(19) 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 



(12) 



(ID EP 1 189 437 A2 

EUROPEAN PATENT APPLICATION 



(43) Date of publication: 

20.03.2002 Bulletin 2002/12 

(21) Application number: 01121833.6 

(22) Date of filing: 1 1 .09.2001 



(51) IntCI. 7 : H04N 5/76 



(84) Designated Contracting States: 


(72) 


Inventors: 


AT BE CH CY DE DK ES Fl FR GB GR IE IT LI LU 


• 


Ferman, Ahmet Mufit 


MC NL PT SE TR 




Vancouver, WA 98683 (US) 


Designated Extension States: 


• 


Errico, James H utter 


AL LT LV MK RO SI 




Portland, OR 97229 (US) 




• 


Van Beek, Petrus 


(30) Priority: 14.09.2000 US 232377 P 




Camas, WA 98607 (US) 


30.03.2001 US 823377 








(74) 


Representative: Muller . Hoffmann & Partner 


(71) Applicant: SHARP KABUSHIKI KAISHA 




Patentanwalte 


Osaka 545-8522 (JP) 




Innere Wiener Strasse 17 
81667 Munchen (DE) 



(54) System for management of audiovisual recordings 



(57) A system for collecting and describing usage 
history information of one or more users in a much im- 
proved manner is provided. The improved system al- 
lows non-intrusive observation of user actions over a pe- 
riod of time, enabling collection of consumption-related 



data without explicit user input. The period of time may 
be specified, modified, or otherwise dynamic, as de- 
sired. The collected usage history provides a list of the 
actions carried out by the user over a period of time, if 
desired, as well as statistical information with respect to 
the content descriptions, if desired. 
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Description 

BACKGROUND OF THE INVENTION 

[0001] The present Invention relates to a system for managing audiovisual information. 

[0002] Video cassette recorders (VCRs) may record video programs in response to pressing a record button or may 
be programmed to record video programs based on the time of day. However, the viewer must program the VCR based 
on information from a television guide to identify relevant programs to record. After recording, the viewer scans through 
the entire video tape to select relevant portions of the program for viewing using the functionality provided by the VCR, 
such as fast forward and fast reverse. Unfortunately, the searching and viewing is based on a linear search, which may 
require significant time to locate the desired portions of the program(s) and fast forward to the desired portion of the 
tape. In addition, it is time consuming to program the VCR in light of the television guide to record desired programs. 
Also, unless the viewer recognizes the programs from the television guide as desirable it is unlikely that the viewer will 
select such programs to be recorded. 

[0003] RePlayTV and TiVo have developed hard disk based systems that receive, record, and play television broad- 
casts in a manner similar to a VCR. The systems may be programmed with the viewer's viewing preferences. The 
systems use a telephone line interface to receive scheduling information similar to that available from a television 
guide. Based upon the system programming and the scheduling information, the system automatically records pro- 
grams that may be of potential interest to the viewer. Unfortunately, viewing the recorded programs occurs in a linear 
manner and may require substantial time. In addition, each system must be programmed for an individual's preference, 
likely in a different manner. 

[0004] Freeman et al., U.S. Patent No. 5,861 ,881 , disclose an interactive computer system where subscribers can 
receive individualized content. 

[0005] With all the aforementioned systems, each individual viewer is required to program the device according to 
his particular viewing preferences. Unfortunately, each different type of device has different capabilities and limitations 
which limit the selections of the viewer. In addition, each device includes a different interface which the viewer may be 
unfamiliar with. Further, if the operator's manual is inadvertently misplaced it may be difficult for the viewer to efficiently 
program the device. 

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 

[0006] FIG. 1 is an exemplary embodiment of a program, a system, and a user, with associated description schemes, 
of an audiovisual system of the present invention. 

[0007] FIG. 2 is an exemplary embodiment of the audiovisual system, including an analysis module, of FIG. 1 . 

[0008] FIG. 3 is an exemplary embodiment of the analysis module of FIG. 2. 

[0009] FIG. 4 is an illustration of a thumbnail view (category) for the audiovisual system. 

[0010] FIG. 5 is an illustration of a thumbnail view (channel) for the audiovisual system. 

[0011] FIG. 6 is an illustration of a text view (channel) for the audiovisual system. 

[0012] FIG. 7 is an illustration of a frame view for the audiovisual system. 

[0013] FIG. 8 is an illustration of a shot view for the audiovisual system. 

[0014] FIG. 9 is an illustration of a key frame view the audiovisual system. 

[0015] FIG. 10 is an illustration of a highlight view for the audiovisual system. 

[0016] FIG. 11 is an illustration of an event view for the audiovisual system. 

[0017] FIG. 12 is an illustration of a character/object view for the audiovisual system. 

[0018] FIG. 13 is an alternative embodiment of a program description scheme including a syntactic structure de- 
scription scheme, a semantic structure description scheme, a visualization description scheme, and a meta information 
description scheme. 

[0019] FIG. 14 is an exemplary embodiment of the visualization description scheme of FIG. 13. 

[0020] FIG. 15 is an exemplary embodiment of the meta information description scheme of FIG. 13. 

[0021] FIG. 1 6 is an exemplary embodiment of a segment description scheme for the syntactic structure description 

scheme of FIG. 13. 

[0022] FIG. 17 is an exemplary embodiment of a region description scheme for the syntactic structure description 
scheme of FIG. 13. 

[0023] FIG. 1 8 is an exemplary embodiment of a segment/region relation description scheme for the syntactic struc- 
ture description scheme of FIG. 13. 

[0024] FIG. 19 is an exemplary embodiment of an event description scheme for the semantic structure description 
scheme of FIG. 13. 

[0025] FIG. 20 is an exemplary embodiment of an object description scheme for the semantic structure description 
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scheme of FIG. 13. 

[0026] FIG. 21 Is an exemplary embodiment of an event/object relation graph description scheme for the syntactic 
structure description scheme of FIG. 13. 

[0027] FIG. 22 is an exemplary embodiment of a user preference description scheme. 

[0028] FIG. 23 is an exemplary embodiment of the interrelationship between a usage history description scheme, 
an agent, and the usage preference description scheme of FIG. 22. 

[0029] FIG. 24 is an exemplary embodiment of the interrelationship between audio and/or video programs together 
with their descriptors, user identification, and the usage preference description scheme of FIG. 22. 
[0030] FIG. 25 is an exemplary embodiment of a usage preference description scheme of FIG. 22. 
[0031] FIG. 26 is an exemplary embodiment of the interrelationship between the usage description schemes and an 
MPEG-7 description schemes. 
[0032] F 
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FIGS. 44A and 44B are an exemplary embodiment of constrained-AND combinatorial operator. 
FIGS. 45A and 45B are an exemplary embodiment of the operators. 
FIGS. 46A and 46B are an exemplary embodiment of the operators. 
FIGS. 47A-47C is an exemplary embodiment of is an example of a mapping table. 
FIG. 48 is an exemplary embodiment of selected combinatorial operators. 
FIG. 49 is an exemplary embodiment of cloning. 
FIG. 50 is another exemplary embodiment of cloning. 

FIG. 51 is a diagram illustrating the use of a filter agent in combination with user preference descriptions, 
audiovisual content, and a user agent. 

52 is a diagram illustrating different elements and their interrelationships. 

53 is a diagram of preference values. 

54 is a diagram of an exemplary test case evaluation. 

55 is a single branch OR'ing combination. 

56 is a single branch more is better OR'ing combination. 

57 is a single branch just slightly more is better OR'ing combination. 
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is a merging of sublists shown in FIGS. 71 and 72 into a combined list. 

is a comparing various presence values across hierarchy AND'ing combination. 

is an OR'ing non-preferences in main branch combination. 

is an OR'ing non-preference qualified combination. 

is an implementation of FIG. 76. 

is an implementation of a non-preference score first, any presence yields lower ranking. 

is an alternative implementation of a non-preference score first, any presence yields lower ranking. 

an implementation of a non-preference filter-first, any presence yields rejection. 

is a context diagram for usage history. 

is a diagram for the collection and representation of usage history information, 
is a diagram of entity-relationships for usage history description schemes. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 



15 [0089] Many households today have many sources of audio and video information, such as multiple television sets, 
multiple VCR's, a home stereo, a home entertainment center, cable television, satellite television, internet broadcasts, 
world wide web, data services, specialized Internet services, portable radio devices, and a stereo in each of their 
vehicles. For each of these devices : a different interface is normally used to obtain, select, record, and play the video 
and/or audio content. For example, a VCR permits the selection of the recording times but the user has to correlate 

20 the television guide with the desired recording times. Another example is the user selecting a preferred set of prese- 
lected radio stations for his home stereo and also presumably selecting the same set of preselected stations for each 
of the user's vehicles. If another household member desires a different set of preselected stereo selections, the pro- 
gramming of each audio device would need to be reprogrammed at substantial inconvenience. 

[0090] The present inventors came to the realization that users of visual information and listeners to audio information, 

25 such as for example radio, audio tapes : video tapes, movies, and news, desire to be entertained and informed in more 
than merely one uniform manner. In other words, the audiovisual information presented to a particular user should be 
in a format and include content suited to their particular viewing preferences. In addition, the format should be dependent 
on the content of the particular audiovisual information. The amount of information presented to a user or a listener 
should be limited to only the amount of detail desired by the particular user at the particular time. For example with the 

30 ever increasing demands on the user's time, the user may desire to watch only 1 0 minutes of or merely the highlights 
of a basketball game. In addition, the present inventors came to the realization that the necessity of programming 
multiple audio and visual devices with their particular viewing preferences is a burdensome task, especially when 
presented with unfamiliar recording devices when traveling. When traveling, users desire to easily configure unfamiliar 
devices, such as audiovisual devices in a hotel room, with their viewing and listening preferences in a efficient manner. 

35 [0091] The present inventors came to the further realization that a convenient technique of merely recording the 
desired audio and video information is not sufficient because the presentation of the information should be in a manner 
that is time efficient, especially in light of the limited time frequently available for the presentation of such information. 
In addition, the user should be able to access only that portion of all of the available information that the user is interested 
in : while skipping the remainder of the information. 

40 [0092] A user is not capable of watching or otherwise listening to the vast potential amount of information available 
through all, or even a small portion of, the sources of audio and video information. In addition, with the increasing 
information potentially available, the user is not likely even aware of the potential content of information that he may 
De interested in. In light of the vast amount of audio, image, and video information, the present inventors came to the 
realization that a system that records and presents to the user audio and video information based upon the user's prior 

45 viewing and listening habits, preferences, and personal characteristics, generally referred to as user information, is 
desirable. In addition, the system may present such information based on the capabilities of the system devices. This 
permits the system to record desirable information and to customize itself automatically to the user and/or listener. It 
is to be understood that user, viewer, and/or listener terms may be used interchangeability for any type of content. 
Also, the user information should be portable between and usable by different devices so that other devices may 

50 likewise be configured automatically to the particular user's preferences upon receiving the viewing information. 

[0093] In light of the foregoing realizations and motivations, the present inventors analyzed a typical audio and video 
presentation environment to determine the significant portions of the typical audiovisual environment. First, referring 
to FIG. 1 the video, image, and/or audio information 10 is provided or otherwise made available to a user and/or a 
(device) system. Second, the video, image, and/or audio information is presented to the user from the system 12 

55 (device), such as a television set or a radio. Third, the user interacts both with the system (device) 12 to view the 
information 10 in a desirable manner and has preferences to define which audio, image, and/or video information is 
obtained in accordance with the user information 14. After the proper identification of the different major aspects of an 
audiovisual system the present inventors then realized that information is needed to describe the informational content 
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of each portion of the audiovisual system 1 6. 

[0094] With three portions of the audiovisual presentation system 16 identified, the functionality of each portion is 
identified together with its interrelationship to the other portions. To define the necessary interrelationships, a set of 
description schemes containing data describing each portion is defined. The description schemes include data that is 

5 auxiliary to the programs 1 0, the system 12, and the user 14, to store a set of information, ranging from human readable 
text to encoded data, that can be used in enabling browsing, filtering, searching, archiving, and personalization. By 
providing a separate description scheme describing the program(s) 10, the user 14, and the system 12, the three 
portions (program, user, and system) may be combined together to provide an interactivity not previously achievable. 
In addition, different programs 10, different users 14, and different systems 12 may be combined together in any com- 

io bination, while still maintaining full compatibility and functionality. It is to be understood that the description scheme 
may contain the data itself or include links to the data, as desired. 

[0095] A program description scheme 18 related to the video, still image, and/or audio information 10 preferably 
includes two sets of information, namely, program views and program profiles. The program views define logical struc- 
tures of the frames of a video that define how the video frames are potentially to be viewed suitable for efficient browsing. 
is For example the program views may contain a set of fields that contain data for the identification of keyframes, segment 
definitions between shots, highlight definitions, video summary definitions, different lengths of highlights, thumbnail 
set of frames, individual shots or scenes, representative frame of the video, grouping of different events, and a close- 
up view. The program view descriptions may contain thumbnail, siide, keyframe, highlights, and close-up views so 
that users can filter and search not only at the program level but also within a particular program. The description 
20 scheme also enables users to access information in varying detail amounts by supporting, for example, a key frame 
view as a part of a program view providing multiple levels of summary ranging from coarse to fine. The program profiles 
define distinctive characteristics of the content of the program, such as actors, stars, rating, director, release date, time 
stamps, keyword identification, trigger profile, still profile, event profile, character profile, object profile, color profile, 
texture profile, shape profile, motion profile, and categories. The program profiles are especially suitable to facilitate 
25 filtering and searching of the audio and video information. The description scheme enables users to have the provision 
of discovering interesting programs that they may be unaware of by providing a user description scheme. The user 
description scheme provides information to a software agent that in turn performs a search and filtering on behalf of 
the user by possibly using the system description scheme and the program description scheme information. It is to be 
understood that in one of the embodiments of the invention merely the program description scheme is included. 
so [0096] Program views contained in the program description scheme are a feature that supports a functionality such 
as close-up view. In the close-up view, a certain image object, e.g., a famous basketball player such as Michael Jordan, 
can be viewed up close by playing back a close-up sequence that is separate from the original program. An alternative 
view can be incorporated in a straightforward manner. Character profile on the other hand may contain spatio-temporal 
position and size of a rectangular region around the character of interest. This region can be enlarged by the presen- 
ts tation engine, or the presentation engine may darken outside the region to focus the user's attention to the characters 
spanning a certain number of frames. Information within the program description scheme may contain data about the 
initial size or location of the region, movement of the region from one frame to another, and duration and terms of the 
number of frames featuring the region. The character profile also provides provision for including text annotation and 
audio annotation about the character as well as web page information, and any other suitable information. Such char- 
40 acter profiles may include the audio annotation which is separate from and in addition to the associated audio track of 
the video. 

[0097] The program description scheme may likewise contain similar information regarding audio (such as radio 
broadcasts) and images (such as analog or digital photographs or a frame of a video). 

[0098] The user description scheme 20 preferably includes the user's personal preferences, and information regard- 
45 ing the user's viewing history such as for example browsing history, filtering history, searching history, and device 
setting history. The user's personal preferences includes information regarding particular programs and categorizations 
of programs that the user prefers to view. The user description scheme may also include personal information about 
the particular user, such as demographic and geographic information, e.g. zip code and age. The explicit definition of 
the particular programs or attributes related thereto permits the system 1 6 to select those programs from the information 
50 contained within the available program description schemes 1 8 that may be of interest to the user. Frequently, the user 
does not desire to learn to program the device nor desire to explicitly program the device. In addition, the user description 
scheme 20 may not be sufficiently robust to include explicit definitions describing all desirable programs for a particular 
user. In such a case, the capability of the user description scheme 20 to adapt to the viewing habits of the user to 
accommodate different viewing characteristics not explicitly provided for or otherwise difficult to describe is useful. In 
55 such a case, the user description scheme 20 may be augmented or any technique can be used to compare the infor- 
mation contained in the user description scheme 20 to the available information contained in the program description 
scheme 1 8 to make selections. The user description scheme provides a technique for holding user preferences ranging 
from program categories to program views, as well as usage history. User description scheme information is persistent 
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but can be updated by the user or by an intelligent software agent on behalf of the user at any arbitrary time. It may 
also be disabled by the user, at any time, if the user decides to do so. In addition, the user description scheme is 
modular and portable so that users can carry or port it from one device to another, such as with a handheld electronic 
device or smart card or transported over a network connecting multiple devices. When user description scheme is 

5 standardized among different manufacturers or products, user preferences become portable. For example, a user can 
personalize the television receiver in a hotel room permitting users to access information they prefer at any time and 
anywhere. In a sense, the user description scheme is persistent and timeless based. In addition, selected information 
within the program description scheme may be encrypted since at least part of the information may be deemed to be 
private (e.g., demographics). A user description scheme may be associated with an audiovisual program broadcast 

10 and compared with a particular user's description scheme of the receiver to readily determine whether or not the 
program's intended audience profile matches that of the user. It is to be understood that in one of the embodiments of 
the invention merely the user description scheme is included. 

[0099] The system description scheme 22 preferably manages the individual programs and other data. The man- 
agement may include maintaining lists of programs, categories, channels, users, videos, audio, and images. The man- 

15 agement may include the capabilities of a device for providing the audio, video, and/or images. Such capabilities may 
include, for example, screen size, stereo, AC3, DTS, color, black/white, etc. The management may also include rela- 
tionships between any one or more of the user, the audio, and the images in relation to one or more of a program 
description scheme(s) and a user description scheme(s). In a similar manner the management may include relation- 
ships between one or more of the program description scheme(s) and user description scheme(s). It is to be understood 

20 that in one of the embodiments of the invention merely the system description scheme is included. 

[0100] The descriptors of the program description scheme and the user description scheme should overlap, at least 
partially, so that potential desirability of the program can be determined by comparing descriptors representative of the 
same information. For example, the program and user description scheme may include the same set of categories and 
actors. The program description scheme has no knowledge of the user description scheme, and vice versa, so that 

25 each description scheme is not dependant on the other for its existence. It is not necessary for the description schemes 
to be fully populated. It is also beneficial not to include the program description scheme with the user description 
scheme because there will likely be thousands of programs with associated description schemes which if combined 
with the user description scheme would result in a unnecessarily large user description scheme. It is desirable to 
maintain the user description scheme small so that it is more readily portable. Accordingly, a system including only the 

30 program description scheme and the user description scheme would be beneficial. 

[0101] The user description scheme and the system description scheme should include at least partially overlapping 
fields. With overlapping fields the system can capture the desired information, which would otherwise not be recognized 
as desirable. The system description scheme preferably includes a list of users and available programs. Based on the 
master list of available programs, and associated program description scheme, the system can match the desired 

35 programs. It is also beneficial not to include the system description scheme with the user description scheme because 
there will likely be thousands of programs stored in the system description schemes which if combined with the user 
description scheme would result in a unnecessarily large user description scheme. It is desirable to maintain the user 
description scheme small so that it is more readily portable. For example, the user description scheme may include 
radio station preselected frequencies and/or types of stations, while the system description scheme includes the avail- 

40 able stations for radio stations in particular cities. When traveling to a different city the user description scheme together 
with the system description scheme will permit reprogramming the radio stations. Accordingly, a system including only 
the system description scheme and the user description scheme would be beneficial. 

[0102] The program description scheme and the system description scheme should include at least partially over- 
lapping fields. With the overlapping fields, the system description scheme will be capable of storing the information 

45 contained within the program description scheme, so that the information is properly indexed. With proper indexing, 
the system is capable of matching such information with the user information, if available, for obtaining and recording 
suitable programs. If the program description scheme and the system description scheme were not overlapping then 
no information would be extracted from the programs and stored. System capabilities specified within the system 
description scheme of a particular viewing system can be correlated with a program description scheme to determine 

50 the views that can be supported by the viewing system. For instance, if the viewing device is not capable of playing 
back video, its system description scheme may describe its viewing capabilities as limited to keyframe view and slide 
view only. Program description scheme of a particular program and system description scheme of the viewing system 
are utilized to present the appropriate views to the viewing system. Thus, a server of programs serves the appropriate 
views according to a particular viewing system's capabilities, which may be communicated over a network or commu- 

55 nication channel connecting the server with user's viewing device. It is preferred to maintain the program description 
scheme separate from the system description scheme because the content providers repackage the content and de- 
scription schemes in different styles, times, and formats. Preferably, the program description scheme is associated 
with the program, even if displayed at a different time. Accordingly, a system including only the system description 
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scheme and the program description scheme would be beneficial. 

[0103] By preferably maintaining the independence of each of the three description schemes while having fields that 
correlate the same information, the programs 10, the users 14, and the system 12 may be interchanged with one 
another while maintaining the functionality of the entire system 16. Referring to FIG. 2, the audio, visual, or audiovisual 

5 program 38, is received by the system 1 6. The program 38 may originate at any suitable source, such as for example 
broadcast television, cable television, satellite television, digital television, Internet broadcasts, world wide web, digital 
video discs, still images, video cameras, laser discs, magnetic media, computer hard drive, video tape, audio tape, 
data services, radio broadcasts, and microwave communications. The program description stream may originate from 
any suitable source, such as for example PSIP/DVB-SI information in digital television broadcasts, specialized digital 

10 television data services, specialized Internet services, world wide web, data files, data over the telephone, and memory, 
such as computer memory. The program, user, and/or system description scheme may be transported over a network 
(communication channel). For example, the system description scheme may be transported to the source to provide 
the source with views or other capabilities that the device is capable of using. In response, the source provides the 
device with image, audio, and/or video content customized or otherwise suitable for the particular device. The system 

75 16 may include any device(s) suitable to receive any one or more of such programs 38. An audiovisual program analysis 
module 42 performs an analysis of the received programs 38 to extract and provide program related information (de- 
scriptors) to the description scheme (DS) generation module 44. The program related information may be extracted 
from the data stream including the program 38 or obtained from any other source, such as for example data transferred 
over a telephone line, data already transferred to the system 1 6 in the past, or data from an associated file. The program 

20 related information preferably includes data defining both the program views and the program profiles available for the 
particular program 38. The analysis module 42 performs an analysis of the programs 38 using information obtained 
from (i) automatic audio-video analysis methods on the basis of low-level features that are extracted from the program 
(s), (ii) event detection techniques, (iii) data that is available (or extractable) from data sources or electronic program 
guides (EPGs, DVB-SI, and PS IP), and (iv) user information obtained from the user description scheme 20 to provide 

25 data defining the program description scheme. 

[0104] The selection of a particular program analysis technique depends on the amount of readily available data and 
the user preferences. For example, if a user prefers to watch a 5 minute video highlight of a particular program, such 
as a basketball game, the analysis module 42 may invoke a knowledge based system 90 (FIG. 3) to determine the 
highlights that form the best 5 minute summary. The knowledge based system 90 may invoke a commercial filter 92 

30 to remove commercials and a slow motion detector 54 to assist in creating the video summary. The analysis module 
42 may also invoke other modules to bring information together (e.g., textual information) to author particular program 
views. For example, if the program 38 is a home video where there is no further information available then the analysis 
module 42 may create a key-frame summary by identifying key-frames of a multi-level summary and passing the 
information to be used to generate the program views, and in particular a key frame view, to the description scheme. 

35 Referring also to FIG. 3, the analysis module 42 may also include other sub-modules, such as for example, a de-mux/ 
decoder 60, a data and service content analyzer 62, a text processing and text summary generator 64, a close caption 
analyzer 66, a title frame generator 68, an analysis manager 70, an audiovisual analysis and feature extractor 72, an 
event detector 74, a key-frame summarizer 76, and a highlight summarizer 78. 

[0105] The generation module 44 receives the system information 46 for the system description scheme. The system 

40 information 46 preferably includes data for the system description scheme 22 generated by the generation module 44. 
The generation module 44 also receives user information 48 including data for the user description scheme. The user 
information 48 preferably includes data for the user description scheme generated within the generation module 44. 
The user input 48 may include, for example, meta information to be included in the program and system description 
scheme. The user description scheme (or corresponding information) is provided to the analysis module 42 for selective 

45 analysis of the program(s) 38. For example, the user description scheme may be suitable for triggering the highlight 
generation functionality for a particular program and thus generating the preferred views and storing associated data 
in the program description scheme. The generation module 44 and the analysis module 42 provide data to a data 
storage unit 50. The storage unit 50 may be any storage device, such as memory or magnetic media. 
[0106] A search, filtering, and browsing (SFB) module 52 implements the description scheme technique by parsing 

50 and extracting information contained within the description scheme. The SFB module 52 may perform filtering, search- 
ing, and browsing of the programs 38, on the basis of the information contained in the description schemes. An intelligent 
software agent is preferably included within the SFB module 52 that gathers and provides user specific information to 
the generation module 44 to be used in authoring and updating the user description scheme (through the generation 
module 44). In this manner, desirable content may be provided to the user though a display 80. The selections of the 

55 desired program(s) to be retrieved, stored, and/or viewed may be programmed, at least in part, through a graphical 
user interface 82. The graphical user interface may also include or be connected to a presentation engine for presenting 
the information to the user through the graphical user interface. 

[0107] The intelligent management and consumption of audiovisual information using the multi-part description 
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stream device provides a next-generation device suitable for the modern era of information overload. The device re- 
sponds to changing lifestyles of individuals and families, and allows everyone to obtain the information they desire 
anytime and anywhere they want. 

[0108] An example of the use of the device may be as follows. A user comes home from work late Friday evening 
5 being happy the work week is finally over. The user desires to catch up with the events of the world and then watch 
ABC's 20/20 show later that evening. It is now 9 PM and the 20/20 show will start in an hour at 10 PM. The user is 
interested in the sporting events of the week, and all the news about the Microsoft case with the Department of Justice. 
The user description scheme may include a profile indicating a desire that the particular user wants to obtain all available 
information regarding the Microsoft trial and selected sporting events for particular teams. In addition, the system 
10 description scheme and program description scheme provide information regarding the content of the available infor- 
mation that may selectively be obtained and recorded. The system, in an autonomous manner, periodically obtains 
and records the audiovisual information that may be of interest to the user during the past week based on the three 
description schemes. The device most likely has recorded more than one hour of audiovisual information so the infor- 
mation needs to be condensed in some manner. The user starts interacting with the system with a pointer or voice 
is commands to indicate a desire to view recorded sporting programs. On the display, the user is presented with a list of 
recorded sporting events including Basketball and Soccer. Apparently the user's favorite Football team did not play 
that week because it was not recorded. The user is interested in basketball games and indicates a desire to view 
games. A set of title frames is presented on the display that captures an important moment of each game. The user 
selects the Chicago Bulls game and indicates a desire to view a 5 minute highlight of the game. The system automat- 
ic ically generates highlights. The highlights may be generated by audio or video analysis, or the program description 
scheme includes data indicating the frames that are presented for a 5 minute highlight. The system may have also 
recorded web-based textual information regarding the particular Chicago-Bulls game which may be selected by the 
user for viewing. If desired, the summarized information may be recorded onto a storage device, such as a DVD with 
a label. The stored information may also include an index code so that it can be located at a later time. After viewing 
25 the sporting events the user may decide to read the news about the Microsoft trial. It is now 9:50 PM and the user is 
done viewing the news. In fact, the user has selected to delete all the recorded news items after viewing them. The 
user then remembers to do one last thing before 10 PM in the evening. The next day, the user desires to watch the 
VHS tape that he received from his brother that day, containing footage about his brother's new baby girl and his 
vacation to Peru last summer. The user wants to watch the whole 2-hour tape but he is anxious to see what the baby 
30 looks like and also the new stadium built in Lima, which was not there last time he visited Peru. The user plans to take 
a quick look at a visual summary of the tape, browse, and perhaps watch a few segments for a couple of minutes, 
before the user takes his daughter to her piano lesson at 10 AM the next morning. The user plugs in the tape into his 
VCR, that is connected to the system, and invokes the summarization functionality of the system to scan the tape and 
prepare a summary. The user can then view the summary the next morning to quickly discover the baby's looks, and 
35 playback segments between the key-frames of the summary to catch a glimpse of the crying baby. The system may 
also record the tape content onto the system hard drive (or storage device) so the video summary can be viewed 
quickly. It is now 10:10 PM, and it seems that the user is 10 minutes late for viewing 20/20. Fortunately, the system, 
based on the three description schemes, has already been recording 20/20 since 10 PM. Now the user can start 
watching the recorded portion of 20/20 as the recording of 20/20 proceeds. The user will be done viewing 20/20 at 11 : 
40 10 PM. 

[0109] The average consumer has an ever increasing number of multimedia devices, such as a home audio system, 
a car stereo, several home television sets, web browsers, etc. The user currently has to customize each of the devices 
for optimal viewing and/or listening preferences. By storing the user preferences on a removable storage device, such 
as a smart card, the user may insert the card including the user preferences into such media devices for automatic 

45 customization. This results in the desired programs being automatically recorded on the VCR, and setting of the radio 
stations for the car stereo and home audio system. In this manner the user only has to specify his preferences at most 
once, on a single device and subsequently, the descriptors are automatically uploaded into devices by the removable 
storage device. The user description scheme may also be loaded into other devices using a wired or wireless network 
connection, e.g. that of a home network. Alternatively, the system can store the user history and create entries in the 

so user description scheme based on trie's audio and video viewing habits. In this manner, the user would never need to 
program the viewing information to obtain desired information. I n a sense, the user descriptor scheme enables modeling 
of the user by providing a central storage for the user's listening, viewing, browsing preferences, and user's behavior. 
This enables devices to be quickly personalized, and enables other components, such as intelligent agents, to com- 
municate on the basis of a standardized description format, and to make smart inferences regarding the user's pref- 

55 erences. 

[0110] Many different realizations and applications can be readily derived from FIGS. 2 and 3 by appropriately or- 
ganizing and utilizing their different parts, or by adding peripherals and extensions as needed. In its most general form, 
FIG. 2 depicts an audiovisual searching, filtering, browsing, and/or recording appliance that is personalizable. The list 
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of more specific applications/implementations given below is not exhaustive but covers a range. 
[0111] The user description scheme is a major enabler for personalizable audiovisual appliances. If the structure 
(syntax and semantics) of the description schemes is known amongst multiple appliances, the user (user) can carry 
(or otherwise transfer) the information contained within his user description scheme from one appliance to another, 
5 perhaps via a smart card-where these appliances support smart card interface- in order to personalize them. Per- 
sonalization can range from device settings, such as display contrast and volume control, to settings of television 
channels, radio stations, web stations, web sites, geographic information, and demographic information such as age, 
zip code etc. Appliances that can be personalized may access content from different sou rces. They may be connected 
to the web, terrestrial or cable broadcast, etc., and they may also access multiple or different types of single media 
10 such as video, music, etc. 

[0112] For example, one can personalize the car stereo using a smart card plugged out of the home system and 
plugged into the car stereo system to be able to tune to favorite stations at certain times. As another example, one can 
also personalize television viewing, for example, by plugging the smart card into a remote control that in turn will 
autonomously command the television receiving system to present the user information about current and future pro- 
fs grams that fits the user's preferences. Different members of the household can instantly personalize the viewing ex- 
perience by inserting their own smart card into the family remote. In the absence of such a remote, this same type of 
personalization can be achieved by plugging in the smart card directly to the television system. The remote may likewise 
control audio systems. In another implementation, the television receiving system holds user description schemes for 
multiple users (users) in local storage and identify different users (or group of users) by using an appropriate input 
20 interface. For example an interface using user-voice identification technology. It is noted that in a networked system 
the user description scheme may be transported over the network. 

[01 1 3] The user description scheme is generated by direct user input, and by using a software that watches the user 
to determine his/her usage pattern and usage history. User description scheme can be updated in a dynamic fashion 
by the user or automatically. A well defined and structured description scheme design allows different devices to in- 

25 teroperate with each other. A modular design also provides portability. 

[0114] The description scheme adds new functionality to those of the current VCR. An advanced VCR system can 
learn from the user via direct input of preferences, or by watching the usage pattern and history of the user. The user 
description scheme holds user's preferences users and usage history. An intelligent agent can then consult with the 
user description scheme and obtain information that it needs for acting on behalf of the user. Through the intelligent 

30 agent, the system acts on behalf of the user to discover programs that fit the taste of the user, alert the user about 
such programs, and/or record them autonomously. An agent can also manage the storage in the system according to 
the user description scheme, i.e., prioritizing the deletion of programs (or alerting the user for transfer to a removable 
media), or determining their compression factor (which directly impacts their visual quality) according to user's prefer- 
ences and history. 

35 [0115] The program description scheme and the system description scheme work in collaboration with the user 
description scheme in achieving some tasks. In addition, the program description scheme and system description 
scheme in an advanced VCR or other system will enable the user to browse, search, and filter audiovisual programs. 
Browsing in the system offers capabilities that are well beyond fast forwarding and rewinding. For instance, the user 
can view a thumbnail view of different categories of programs stored in the system. The user then may choose frame 

40 view, shot view, key frame view, or highlight view, depending on their availability and user's preference. These views 
can be readily invoked using the relevant information in the program description scheme, especially in program views. 
The user at any time can start viewing the program either in parts, or in its entirety. 

[0116] In this application, the program description scheme may be readily available from many services such as: (i) 
from broadcast (carried by EPG defined as a part of ATSC-PSIP (ATSC-Program Service Integration Protocol) in USA 
4 5 or DVB-SI (Digital Video Broadcast-Service Information) in Europe); (ii) from specialized data services (in addition to 
PSIP/DVB-SI); (iii) from specialized web sites; (iv) from the media storage unit containing the audiovisual content (e. 
g., DVD); (v) from advanced cameras (discussed later), and/or may be generated (i.e., for programs that are being 
stored) by the analysis module 42 or by user input 48. 

[0117] Contents of digital still and video cameras can be stored and managed by a system that implements the 
50 description schemes, e.g., a system as shown in FIG. 2. Advanced cameras can store a program description scheme, 
for instance, in addition to the audiovisual content itself. The program description scheme can be generated either in 
part or in its entirety on the camera itself via an appropriate user input interface (e.g., speech, visual menu drive, etc.). 
Users can input to the camera the program description scheme information, especially those high-level (or semantic) 
information that may otherwise be difficult to automatically extract by the system. Some camera settings and parameters 
55 (e.g., date and time), as well as quantities computed in the camera (e.g., color histogram to be included in the color 
profile), can also be used in generating the program description scheme. Once the camera is connected, the system 
can browse the camera content, or transfer the camera content and its description scheme to the local storage for 
future use. It is also possible to update or add information to the description scheme generated in the camera. 
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[0118] The IEEE 1394 and Havi standard specifications enable this type of "audiovisual content" centric communi- 
cation among devices. The description scheme API's can be used in the context of Havi to browse and/or search the 
contents of a camera or a DVD which also contain a description scheme associated with their content, i.e., doing more 
than merely invoking the PLAY API to play back and linearly view the media. The description schemes may be used 
5 in archiving audiovisual programs in a database. The search engine uses the information contained in the program 
description scheme to retrieve programs on the basis of their content. The program description scheme can also be 
used in navigating through the contents of the database or the query results. The user description scheme can be used 
in prioritizing the results of the user query during presentation. It is possible of course to make the program description 
scheme more comprehensive depending on the nature of the particular application. 
10 [0119] The description scheme fulfills the user's desire to have applications that pay attention and are responsive 
to their viewing and usage habits, preferences, and personal demographics. The proposed user description scheme 
directly addresses this desire in its selection of fields and interrelationship to other description schemes. Because the 
description schemes are modular in nature, the user can port his user description scheme from one device to another 
in order to "personalize" the device. 
15 [01 20] The proposed description schemes can be incorporated into current products similar to those from TiVo and 
Replay TV in order to extend their entertainment informational value. In particular, the description scheme will enable 
audiovisual browsing and searching of programs and enable filtering within a particular program by supporting multiple 
program views such as the highlight view. In addition, the description scheme will handle programs coming from sources 
other than television broadcasts for which TiVo and Replay TV are not designed to handle. In addition, by standardi- 
se zation of TiVo and Replay TV type of devices, other products may be interconnected to such devices to extend their 
capabilities, such as devices supporting an MPEG 7 description. MPEG-7 is the Moving Pictures Experts Group - 7, 
acting to standardize descriptions and description schemes for audiovisual information. The device may also be ex- 
tended to be personalized by multiple users, as desired. 

[0121] Because the description scheme is defined, the intelligent software agents can communicate among them- 
25 selves to make intelligent inferences regarding the user's preferences. In addition, the development and upgrade of 
intelligent software agents for browsing and filtering applications can be simplified based on the standardized user 
description scheme. 

[0122] The description scheme is multi-modal in the following sense that it holds both high level (semantic) and low 
level features and/or descriptors. For example, the high and low level descriptors are actor name and motion model 
30 parameters, respectively. High level descriptors are easily readable by humans while low level descriptors are more 
easily read by machines and less understandable by humans. The program description scheme can be readily har- 
monized with existing EPG, PSIP, and DVB-SI information facilitating search and filtering of broadcast programs. Ex- 
isting services can be extended in the future by incorporating additional information using the compliant description 
scheme. 

35 [0123] For example, one case may include audiovisual programs that are prerecorded on a media such as a digital 
video disc where the digital video disc also contains a description scheme that has the same syntax and semantics of 
the description scheme that the FSB module uses. If the FSB module uses a different description scheme, a transcoder 
(converter) of the description scheme may be employed. The user may want to browse and view the content of the 
digital video disc. In this case, the user may not need to invoke the analysis module to author a program description. 

40 However, the user may want to invoke his or her user description scheme in filtering, searching and browsing the digital 
video disc content. Other sources of program information may likewise be used in the same manner. 
[01 24] It is to be understood that any of the techniques described herein with relation to video are equally applicable 
to images (such as still image or a frame of a video) and audio (such as radio). 

[0125] An example of an audiovisual interface is shown in FIGS. 4-12 which is suitable for the preferred audiovisual 
45 description scheme. Referring to FIG. 4, by selecting the thumbnail function as a function of category provides a display 
with a set of categories on the left hand side. Selecting a particular category, such as news, provides a set of thumbnail 
views of different programs that are currently available for viewing. In addition, the different programs may also include 
programs that will be available at a different time for viewing. The thumbnail views are short video segments that 
provide an indication of the content of the respective actual program that it corresponds with. Referring to FIG. 5, a 
50 thumbnail view of available programs in terms of channels may be displayed, if desired. Referring to FIG. 6, a text view 
of available programs in terms of channels may be displayed, if desired. Referring to FIG. 7, a frame view of particular 
programs may be displayed, if desired. A representative frame is displayed in the center of the display with a set of 
representative frames of different programs in the left hand column. The frequency of the number of frames may be 
selected, as desired. Also a set of frames are displayed on the lower portion of the display representative of different 
55 frames during the particular selected program. Referring to FIG. 8, a shot view of particular programs may be displayed, 
as desired. A representative frame of a shot is displayed in the center of the display with a set of representative frames 
of different programs in the left hand column. Also a set of shots are displayed on the lower portion of the display 
representative of different shots (segments of a program, typically sequential in nature) during the particular selected 
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program. Referring to FIG. 9, a key frame view of particular programs may be displayed, as desired. A representative 
frame is displayed in the center of the display with a set of representative frames of different programs in the left hand 
column. Also a set of key frame views are displayed on the lower portion of the display representative of different key 
frame portions during the particular selected program. The number of key frames in each key frame view can be 
5 adjusted by selecting the level. Referring to FIG. 1 0, a highlight view may likewise be displayed, as desired. Referring 
to FIG. 11, an event view may likewise be displayed, as desired. Referring to FIG. 12, a character/object view may 
likewise be displayed, as desired. 

[01 26] An example of the description schemes is shown below in XML. The description scheme may be implemented 
in any language and include any of the included descriptions (or more), as desired. 
10 [0127] The proposed program description scheme includes three major sections for describing a video program. The 
first section identifies the described program. The second section defines a number of views which may be useful in 
browsing applications. The third section defines a number of profiles which may be useful in filtering and search ap- 
plications. Therefore, the overall structure of the proposed description scheme is as follows: 

15 

<?XMLversion="1.0"t> 

<!DOCTYPE MPEG-7 SYSTEM "mpeg-7.dtd"> 
20 <ProgramIdentity> 

<ProgramID> ... </ProgramID> 
<ProgramName> ... </ProgramN ame> 
25 <SourceLocation> ... </SourccLocation> 

</ProgramIdentity> 
<ProgramViews> 
<ThumbnailView> ... </ThumbnaiIView> 
<SlideVicw> ... </SlideView> 
<FrameView> ... </FrameView> 
<ShotView> ... </ShotView> 
<Key Frame View> ... </KeyFrameView> 
<HighlightView> ... </HighlightView> 
<EvcntVicw> ... </EventView> 
<CIoseUpView> ... </CloseUpView> 
45 <AltemateView> ... </AlternateView> 



50 



30 



35 



40 



55 
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5 
10 
15 
20 
25 
30 

Program Identity 
35 [0128] 

• Program ID 

40 

<ProgramID> program- id </ProgramID> 
The descriptor <ProgramlD> contains a number or a string to identify a program. 

45 

• Program name 

50 

<ProgramName> program-name </ProgramName> 
The descriptor <ProgramName> specifies the name of a program. 

55 

• Source location 

12 
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</ProgramViews> 

<ProgramProfiles> 
<GeneralProfile> ... </GencralProfile> 
<CategoryProfiIO ... </CategoryProfile> 
<DateTimeProfile> ... </DateTimeProfiIe> 
<K.eywordProfile> ... </KeywordProfilO 
<TriggerProfile> ... </TriggerProf51e> 
<StillProfile> ... </StillProfile> 
<EventProfiIe> ... ^EventProfilO 
<CharacterProfile> ... </CharacterProfile> 
<ObjectProfile> ... </ObjectProfile> 
<ColorProfile> ... </Co!orProfile> 
<TextureProfile> ... <7TextureProfile> 
<ShapeProflIe> ... </ShapeProfi!e> 
<MotionProfile> ... </MotionProfile> 

</ProgramProfiles> 
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<SourceLocation> source-url </SourccLocation> 

5 

The descriptor <SourceLocation> specifies the location of a program in URL format. 
Program Views 
10 [0129] 

• Thumbnail view 



15 



20 



<ThumbnailView> 

<Image> thumbnail-image </Image> 
</ThumbnailView> 



The descriptor <ThumbnailView> specifies an image as the thumbnail representation of a program. 
25 • Slide view 



30 



<SlideView> frame-id ... </SlideView> 



The descriptor <SlideView> specifies a number of frames in a program which may be viewed as snapshots 
or in a slide show manner. 



35 • Frame view 



40 



<FrameView> start-frame-id end- frame-id </FrameView> 



The descriptor <FrameView> specifies the start and end frames of a program. This is the most basic view of 
a program and any program has a frame view. 



45 • Shot view 



50 



<ShotView> 

<Shot id=*"*> start-frame- id end-frame-id display- frame-id </Shot> 
<Shot id=""> start-frame-id end-frame-id display-frame-id </Shor> 



55 



</ShotView> 

The descriptor <ShotView> specifies a number of shots in a program. The <Shot> descriptor defines the start 
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and end frames of a shot. It may also specify a frame to represent the shot. 
Key-frame view 

<K.eyFrameView> 

<KeyFrames level=""> 
<Clip id=""> start-frame-id end-frame-id display-frame-id </CIip> 
<CIip id=""> start-frame-id end-frame-id display-frame-id </CUp> 



15 </KeyFrames> 

<KeyFrames level= m *> 
<CIip id= ,ft *> start-frame-id end-frame-id display-frame-id </CHp> 
20 <Clip id=""> start-frame-id cnd-frame-id display- frame-id </Clip> 



</K.eyFrarnes> 

25 

</KeyFrameView> 

30 The descriptor <KeyFrameView> specifies key frames in a program. The key frames may be organized in a 

hierarchical manner and the hierarchy is captured by the descriptor <KeyFrames> with a level attribute. The clips 
which are associated with each key frame are defined by the descriptor <Clip>. Here the display frame in each 
clip is the corresponding key frame. 

35 • Highlight view 



4Q <HighIightVicw> 

<Highlight length= Mn > 



45 



50 



55 
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<C3ip id="'*> start- frame- id end-frame-id display-frame- id </Clip> 
<Clip id=""> stan-frame-id end-frame-id display- frame-id </CUp> 

</Highlight> 

<Highlight length= ,w > 
<CIip id=""> start- frame-id end-frame-id display- frame-id </CIip> 
<Clip id=""> start-frame-id end-frame-id display-frame-id <JCY\p> 

</HighIight> 



</HighlightView> 

The descriptor <HighlightView> specifies clips to form highlights of a program. A program may have different 
versions of highlights which are tailored into various time length, The clips are grouped into each version of highlight 
which is specified by the descriptor <Highlight> with a length attribute. 

Event view 



<EventView> 

<Events name=""> 
<Clip id=""> start-frame-id end-frame-id display- frame-id </Clip» 
<Clip id= w, > start-frame-id end-frame-id display- frame-id </Clip> 

</Events> 
<Events name=""> 
<Clip id= M "> start-frame-id end-frame-id display-frame-id </Clip> 



<CIip id=""> start-frame- id end-frame-id display- frame- id </Clip> 
</Events> 
</EventView> 



15 



10 



15 



20 



25 
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The descriptor <EventView> specifies clips which are related to certain events in a program. The clips are 
grouped into the corresponding events which are specified by the descriptor <Event> with a name attribute. 

• Close-up view 



<CloseUpView> 

<Target name= m *> 
<CHp id=' M > start-frame-id end-frame-id display-frame-id </Clip> 
<Clip id- "*> start-frame-id end-frame-id display-frame-id </Clip> 

</Target> 

<Target name= n "> 
<Clip id= m *> start-frame-id end-frame-id display-frame-id </Clip> 
<Clip id=""> start-frame-id end-frame-id display-frame-id </Clip> 

</Target> 

<CloseUpView> 



The descriptor <CloseUpView> specifies clips which may be zoomed in to certain targets in a program. The 
clips are grouped into the corresponding targets which are specified by the descriptor <Target> with a name at- 
35 tribute. 

• Alternate view 



40 



45 



50 



<AlternateView> 

<AlternateSource id=""> source-url </AltemaleSource> 
<AltemateSource id= mi > source-url </AltemateSource> 

</AlternateView> 

The descriptor <AlternateView> specifies sources which may be shown as alternate views of a program. Each 
alternate view is specified by the descriptor <AlternateSource> with an id attribute. The locate of the source may 
be specified in URL format. 



55 
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10 



15 



<GeneralProfiIe> 

<Title> title-text </Title> 
<Abstract> abstract-text </Abstract> 
<Audio> voice-annotation </Audio> 



20 



25 



30 



35 



<Www> web-page-url </Www> 
<ClosedCaption> yes/no </ClosedCaption> 
<Language> language-name </Language> 
<Rating> rating </Rating> 
<Length> time </Length> 
<Authors> author-name ... </Authors> 
<Producers> producer-name ... </Producers> 
<Directors> director-name ... </Directors> 
<Actors> actor-name ... </Actors> 



</GeneralProfile> 

40 

The descriptor <GeneralProfile> describes the general aspects of a program. 

• Category profile 

45 

<CategoryProfile> category-name ... </CategoryProfile> 

50 The descriptor <CategoryProfile> specifies the categories under which a program may be classified. 

• Date-time profile 

55 
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<DateTimeProfile> 

<ProductionDate> date </ProductionDate> 
<ReleaseDate> date </ReleaseDate> 



<RecordingDate> date </RecordingDate> 
<RecordingTime> time </RecordingTime> 

</DateTimeProfIle> 

The descriptor <DateTimeProfile> specifies various date and time information of a program. 
Keyword profile 

<KeywordProflIe> keyword ... </KeywordProfiIe> 

The descriptor <KeywordProfile> specifies a number of keywords which may be used to filter or search a 
program. 

Trigger profile 

<TriggerProfiIe> trigger-frame-id ... </TriggerProflle> 

The descriptor <TriggerProf ile> specifies a number of frames in a program which may be used to trigger certain 
actions while the playback of the program. 

Still profile 
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<StillProfile> 

5 <Still id=""> 

<HotRegion id = m *> 

<Location> xl yl x2 y2 </Location> 
10 <Text> text-annotation </Text> 

<Audio> voice-annotation </Audio> 
<Www> web-page-url </Www> 
</HotRegion> 
<HotRegion id =""> 

<Location> xl yl x2 y2 </Location> 
<Text> text-annotation </Text> 
<Audio> voice-annotation </Audio> 
<Www> web-page-url </Www> 
</HotRegion> 



15 



20 



25 



30 <SM> 

<Sti\\ id=""> 
<HotRegion id =""> 

35 <Location> xl yl x2 y2 </Location> 

<Text> text-annotation </Text> 
<Audio> voice-annotation </Audio> 
40 <Www> web-page-url <AVww> 

</HotRegion> 
<HotRegion id 

4 ^ <Location> x 1 y 1 x2 y2 </Location> 

<Text> text-annotation </Text> 
<Audio> voice-annotation </Audio> 

50 
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<Www> web-page-url <AVww> 
</HotRegion> 

</Sti\\> 
</SttIlProfile> 

The descriptor <StillProfile> specifies hot regions or regions of interest within a frame. The frame is specified 
by the descriptor <Still> with an id attribute which corresponds to the frame-id. Within a frame, each hot region is 
specified by the descriptor <HotRegion> with an id attribute. 

Event profile 

<EventProfiIe> 

<EventList> event-name ... </EventList> 
<Evcnt name=""> 
<Www> web-page-url <AVww> 
<OccuTTence id=*"*> 

<Duration> start-frame-id end-frame-id </Duration> 
<Text> text-annotation </Text> 
<Audio> voice-annotation </Audio> 
</Occurrence> 
<Occurrence id=""> 

<Duration> start-frame- id end-frame-id </Duration> 
<Text> text-annotation </Text> 
<Audio> voice-annotation </Audio> 
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</Occurrence> 



</Event> 
<Evcnt narne=""> 
<Www> web-page-url </Www> 
<Occurrence id=""> 

<Duration> start-frame-id end-frame-id </Duration> 

<Text> text-annotation </Text> 

<Audio> voice-annotation </Audio> 
</Occurrence> 
<Occurrence id =mi > 

<Duration> start-frame-id end-frame-id </Duration> 

<Text> text-annotation </Text> 

<Audio> voice-annotation </Audio> 
</Occurrence> 



</Event> 



</EventProfile> 



The descriptor <EventProfile> specifies the detailed information for certain events in a program. Each event 
is specified by the descriptor <Event> with a name attribute. Each occurrence of an event is specified by the 
descriptor <Occurrence> with an id attribute which may be matched with a clip id under <EventView>. 

Character profile 
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<CharacterProfile> 

<CharacterList> character-name ... </CharacterList> 
<Character name^"*^ 
<ActorName> actor-name </ActorName> 
<Gender> male </Gender> 
<Age> age </Age> 
<Www> web-page-ur! </Www> 
Occurrence id=' M *> 

<Duration> start-frame-id end-frame-id </Duration> 
<Location> frame: [x I yl x2 yl] ... </Location> 
<Motion> v x v y v x v n v p v y </Motion> 
<Text> text-annotation </Text> 
<Audio> voice-annotation </Audio> 
</OccurTence> 
<Occurrence id=="**> 

<Duration> start- frame-id end-framc-id </Duration> 
<Location> frame: [x I yl x2 yl] ... </Location> 
<Motion> v y v a v 0 v p v f </Motion> 
<Text> text-annotation </Text> 
<Audio> voice-annotation </Audio> 
</Occurrence> 

</Character> 
<Character namc=""> 

<ActorName> actor-name </ActorName> 

<Gender> male </Gender> 

<Age> age </Age> 
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<Www> web-page-url <AVww> 
<Occurrence id~ M "> 

<Duration> start-frame-id end-frame-id </Duration> 

<Location> frame:[xl yl x2 y2] ... </Location> 

<Motion> v, v y v 2 v 0 v p v T </Motion> 

<Text> text-annotation </Text> 

<Audio> voice-annotation </Audio> 
<yOccurrence> 
<Cccurrence id= ,M> > 

<Duration> start-frame-id end- frame-id </Duration> 

<Location> frame:[xl yl x2 y2] ... </Location> 

<Motion> v a \ y v t v a v p v Y </Motion> 

<Text> text-annotation </Text> 

<Audio> voice-annotation </Audio> 
</Occurrence> 

</Character> 

</CharacterProfile> 

The descriptor <Character Prof i le> specif ies the detailed information for certain characters in a program. Each 
character is specified by the descriptor <Character> with a name attribute. Each occurrence of a character is 
specified by the descriptor <Occurrence> with an id attribute which may be matched with a clip id under <Close- 
UpView>. 

Object profile 
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10 



<ObjectProfile> 

<ObjcctList> object-name ... </ObjcctList> 
<Object name=""> 
<Www> wcb-page-url </Www> 
<Occurrence id~""> 

<Duration> start-frame-id end-frame-id </Duran"on> 
<Location> frame:[xl yl x2 y2] ... </Location> 
1 5 <Motion> v x v y v x v a v^ v y <Motion> 

<Text> text-annotation </Text> 
<Audio> voice-annotation </Audio> 
</Occurrence> 
<Occurrence rd=*"> 

<Duratiori> start-frame- id end-frame-id </Duration> 
<Location> frame: [x I yl x2 y2] ... </Location> 
<Motion> v x v y v x v a v p v y </Motion> 



20 



25 
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35 



<Text> text-annotation </Text> 
<Audio> voice-annotation </Audio> 
</Occurrence> 



</Object> 
<Object name=""> 

40 <Www> web-page-url </Www> 

<Occurrence id='"*> 

<Duration> start-frame-id end-frame- id </Duration> 

45 <Location> frame: [xl yl x2 y2] ... </Locarion> 

<Motion> v m v y v z v a v p v y </Motion> 
<Text> text-annotation </Text> 

so 
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<Audio> voice-annotation </Audio> 
</Occurrence> 
<OccurTence id^*^ 

10 <Duration> start-frame-id end-frame-id </Duration> 

<Location> frame:[xl yl x2 y2] ... </Location> 
<Motion> v x v y v, v 0 v p v y </Motion> 

75 <Text> text-annotation </Text> 

<Audio> voice-annotation </Audio> 
</Occurrence> 



20 



25 



45 



55 



</Object> 



</ObjectProflIe> 

The descriptor <ObjectProfile> specifies the detailed information for certain objects in a program. Each object 
30 js specified by the descriptor <Object> with a name attribute. Each occurrence of a object is specified by the 

descriptor <Occurrence> with an id attribute which may be matched with a clip id under <CloseUpView>. 

• Color profile 

35 

<ColorProfile> 



40 </CoIorProfiIe> 



The descriptor <ColorProfile> specifies the detailed color information of a program. All MPEG-7 color descrip- 
tors may be placed under here. 



Texture profile 



50 <TextureProfile> 



</TextureProfile> 



The descriptor <TextureProfile> specifies the detailed texture information of a program. All MPEG-7 texture 
descriptors may be placed under here. 
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• Shape profile 

5 <ShapeProfile> 

</ShapeProfile> 

10 

The descriptor <ShapeProfile> specifies the detailed shape information of a program. All MPEG-7 shape de- 
scriptors may be placed under here. 



15 m Motion profile 



<MotionProfilc> 

20 

</MotionProfiie> 

25 

The descriptor <MotionProfile> specifies the detailed motion information of a program. All MPEG-7 motion 
descriptors may be placed under here. 

30 User Description Scheme 

[0131] The proposed user description scheme includes three major sections for describing a user. The first section 
identifies the described user. The second section records a number of settings which may be preferred by the user. 
The third section records some statistics which may reflect certain usage patterns of the user. Therefore, the overall 
35 structure of the proposed description scheme is as follows: 
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<?XML version-" UT> 

<!DOCTYPE MPEG-7 SYSTEM "mpeg-7.dtd"> 

<UserIdenrity> 
<UserID> ... </UserID> 
<UserName> ... </UserName> 

<AJserldentity> 

<UserPrcfcrences> 
<BrowsingPrcfcrences> ... </BrowsingPrefcrences> 
<FilteringPreferences> ... </FilteringPreferences> 
<SearchPreferences> ... </SearchPreferences> 
<DevicePreferences> ... </DcvicePreferences> 

</UserPreferences> 

<UserHistory> 
<BrowsingHistory> ... </BrowsingHistory> 

' <FilteringHistory> ... </FilteringHistory> 
<SearchHistory> ... </SearchHistory> 



<DeviceHistory> ... </DcviccHistory> 
</UserHistory> 
<UserDemographics> 

<Age> ... </Age> 

<Gender> ... </Gender> 

<ZIP> ... </ZIP> 
</UserDcmographics> 

User Identity 

[0132] 

• User ID 
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<UserID> user-id </UserlD> 

The descriptor <UserED> contains a number or a string to identify a user. 

• User name 

<UserName> user-name </UserName> 
The descriptor <UserName> specifies the name of a user. 
User Preferences 
[0133] 

• Browsing preferences 

<BrowsingPreferences> 
<Views> 

<ViewCategory id =m > view-id ... </ViewCategory> 
<ViewCategory id= m *> view-id ... </ViewCategory> 

</Views> 

<FrameFrequency> frequency ...<FrameFrequency> 
<ShotFrequency> frequency ...<ShotFrequency> 
<KeyFrameLeveI> level-id ...<KeyFrameLeveI> 
<HighIightLength> length ...<HighlightLength> 

</BrowsingPreferences> 

The descriptor <BrowsingPreferences> specifies the browsing preferences of a user. The user's preferred 
views are specified by the descriptor <Views>. For each category, the preferred views are specified by the de- 
scriptor <ViewCategory> with an id attribute which corresponds to the category id. The descriptor < Frame Frequ en - 
cy> specifies at what interval the frames should be displayed on a browsing slider under the frame view. The 
descriptor <ShotFrequency> specifies at what interval the shots should be displayed on a browsing slider under 
the shot view. The descriptor <KeyFrameLevel> specifies at what level the key frames should be displayed on a 
browsing slider under the key frame view. The descriptor <Highlightl_ength> specifies which version of the highlight 
should be shown under the highlight view. 

• Filtering preferences 
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<Fi!teringPreferences> 

5 

<C&tegories> category-name ... -CCatcgorie^ 
<Channels> channel-number ... </Channels> 
<Ratings> rating-id ... </Ratings> 

10 

<Shows> show-name ... </Shows> 
<Authors> author-name ... </Authors> 
<Producers> producer-name ... </Producers> 

15 

<Directors> director-name ... </Directors> 
<Actors> actor-name ... </Actors> 
20 <FCeywords> keyword ... </Keywords> 

<TitIes> title-text ... </Titles> 

25 </FiIteringPreferences> 

The descriptor <FilteringPreferences> specifies the filtering related preferences of a user 
• Search preferences 

30 

<SearchPreferences> 

35 



40 



45 



50 



55 



29 



BNSDOCID: <EP 1 1 89437 A2_l_> 



EP1 189 437 A2 



<Categories> category-name ... </Categories> 
5 <Channe!s> channel-number ... <Channels> 

<Ratings> rating-id ... </Ratings> 

<Shows> show-name ... </Shows> 
10 <Authors> author-name ... </Authors> 

<Producers> producer-name ... </Producers> 

<Directors> director-name ... </Directors> 
b <Actors> actor-name ... </Actors> 

<Keywords> keyword ... </Keywords> 

<Titles> title-text ... </TitIes> 

20 

</SearchPreferences> 

25 

The descriptor <SearchPreferences> specifies the search related preferences of a user. 

• Device preferences 

30 

<DevicePreferences> 

<Brightaess> brightness- value </Brightness> 
<Contrast> contrast-value </Contrast> 

35 

<Volume> volume-value <A r olume> 
</DevicePreferences> 

40 

The descriptor <DevicePreferences> specifies the device preferences of a user. 
Usage History 
45 [0134] 

• Browsing history 

50 
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10 



15 



20 



<B rowsingHistory> 
<Views> 

<ViewCategory id= m ^ view-id ... </ViewCategory> 
<ViewCategory id= H "> view-id ... </ViewCategory> 

</Views> 

<FrameFrcquency> frequency ...<FrameFrequency> 
<ShotFrequency> frequency ...<ShotFrequency> 
<KeyFrameLevel> level-id ...<KeyFrameLevel> 
<HighlightLength> length ...<HighlightLength> 



</BrowsingHistory> 

25 The descriptor <BrowsingHistory> captures the history of a user's browsing related activities. 

• Filtering history 

30 

<FilteringHistory> 

<Categories> category-name ... </Categories> 
35 <Channe1s> channel-number ... </Channels> 

<Ratings> rating-id ... </Ratings> 
<Shows> show-name ... </Shows> 

40 



45 <Authors> author-name ... </Authors> 

<Producers> producer-name ... </Producers> 
<Directors> director-name ... </Directors> 

50 <Actors> actor-name ... </Actors> 

<Keywords> keyword ... <tfCeywords> 
<Titles> title-text ... </Titles> 



55 



</FiIteringHistory> 



31 

BNSDOCID: <EP 1 189437A2_I_> 



EP 1 189 437 A2 

The descriptor <FilteringHistory> captures the history of a user's filtering related activities. 
Search history 



10 



15 



20 



25 



<SearchHistory> 

<Categories> category-name ... </Categories> 
<Channels> channel-number ... </Channels> 
<Ratings> rating-id ... </Ratings> 
<Shows> show-name ... </Shows> 
<Authors> authoi-name ... </Authors> 
<Producers> producer-name ... </Producers> 
<Directors> director-name ... </Directors> 
<Actors> actor-name ... </Actors> 
<Keywords> keyword ... </Keywords> 
<Titles> title-text ... </Titles> 



</SearchHistory> 

30 

The descriptor <SearchHistory> captures the history of a user's search related activities. 

• Device history 

35 

<DeviceHistory> 

<Brightness> brightness- value ... </Brightness> 

40 

<Contrast> contrast-value ... </Contrast> 
<Volume> volume-value ... </VoIume> 
^ </DeviceHistory> 

The descriptor <DeviceHistory> captures the history of a user's device related activities. 
50 User demographics 
[0135] 

• Age 

55 
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<Agc> age </Age> 

5 

The descriptor <Age> specifies the age of a user. 

• Gender 

10 

<Gender> ... </Gendcr> 

15 

The descriptor <Gender> specifies the gender of a user. 

• ZIP code 

20 

<2IP> ... </2IP> 

25 

The descriptor <ZIP> specifies the ZIP code of where a user lives. 
System Description Scheme 

30 

[01 36] The proposed system description scheme includes four major sections for describing a user. The first section 
identifies the described system. The second section keeps a list of all known users. The third section keeps lists of 
available programs. The fourth section describes the capabilities of the system. Therefore, the overall structure of the 
proposed description scheme is as follows: 

35 



40 



45 



50 



55 



33 



BNSDOCID: <EP 1 1 89437 A2_l_> 



EP 1 189 437 A2 



<?XML version="1.0"> 

<!DOCTYPE MPEG-7 SYSTEM "mpeg-7.dtd"> 
<SystemIdentity> 

<SystemID> ... </SystemlD> 

<SystemName> ... </SystemName> 

<SystemSerialNumber> ... </SysternSerialNumber> 
</SystemIdentity> 
<SystemUsers> 

<Users> ... </Users> 
</SystemUsers> 
<SystemPrograms> 

<Categories> ... </Categories> 

<Channels> ... </Channels> 

<Programs> ... </Programs> 
</SystemPrograms> 
<S ystemCapab il itics> 

<Views> ... </Views> 
</SystemCapabilities> 

System Identity 
[0137] 

• System ID 

<SystemID> system-id </SystemID> 

The descriptor <SystemlD> contains a number or a string to identify a video system 

• System name 

<SystemName> system-name </SystemName> 
The descriptor <SystemName> specifies the name of a video system or device. 

34 
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System serial number 
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<SystcmSerialNumbcr> systcm-scrial-n umber </SystcmScria!Numbcr> 



The descriptor <SystemSerialNumber> specifies the serial number of a video system or device. 

10 

System Users 
[0138] 
15 • Users 



<Users> 

20 

<User> 
<UserID> user-id </UserID> 
25 <UserName> user-name </UserName> 

</User> 
<Uscr> 

30 <UserID> user-id </UserID> 

<UserName> user-name </UserName> 
</User> 

35 

</Users> 

40 The descriptor <SystemUsers> lists a number of users who have registered on a video system or device. Each 

user is specified by the descriptor <User>. The descriptor <UserlD> specifies a number or a string which should 
match with the number or string specified in <UserlD> in one of the user description schemes. 

Programs in the System 

45 

[0139] 

• Categories 

50 
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<Categories> 

<Category> 

<CategoryID> category-id </CategoryID> 

<CategoryName> category-name </CategoryName> 

<SubCategories> sub-category-id ... </SubCategories> 
</Category> 

<Category> 
<CategoryID> category-id </CategoryID> 
<CategoryName> category-name </CategoryName> 
<SubCategories> sub-category-id ... </SubCategories> 

</Category> 

</Categories> 

The descriptor <Categories> lists a number of categories which have been registered on a video system or 
device. Each category is specified by the descriptor <Category>. The major-sub relationship between categories 
is captured by the descriptor < SubCategories>. 

Channels 

<ChanneIs> 

<Channel> 

<ChannclID> channel-id </ChannelID> 

<ChannelName> channel-name </ChanneIName> 

<SubChannel$> sub-channel-id ... </SubChannc!s> 
</Channel> 
<Channel> 

<ChannelID> channel-id </ChannelID> 

<ChannelName> channel-name </ChannelName> 

<SubChannels> sub-channel-id ... </SubChannels> 
</Channel> 
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</ChanneIs> 



5 



The descriptor <Channels> lists a number of channels which have been registered on a video system or 
device. Each channel is specified by the descriptor <Channel>. The major-sub relationship between channels is 
captured by the descriptor < SubChannels>. 



10 



Programs 



15 



<Programs> 



<CategoryPrograms> 
<CategoryID> category-id </CategoryID> 
<Programs> program-id ... </Programs> 

</CategoryPrograms> 

<CategoryPrograms> 



<CategoryID> category-id </CategoryrD> 
<Programs> program-id ... </Programs> 
</CatcgoryPrograms> 

<ChannelPrograrns> 
<ChanneIID> channel-id </ChanneIID> 
<Programs> program-id ... </Programs> 

</ChannelPrograms> 



<ChannelID> channel-id </ChannelID> 
<Programs> program-id ... </Programs> 
</ChannelPrograms> 



25 



40 



<ChannelPrograms> 



50 



</Programs> 



55 



The descriptor <Programs> lists programs who are available on a video system or device. The programs are 
grouped under corresponding categories or channels. Each group of programs are specified by the descriptor 
<CategoryPrograms> or <ChannelPrograms>. Each program id contained in the descriptor <Programs> should 
match with the number or string specified in <ProgramlD> in one of the program description schemes. 
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System Capabilities 
[0140] 
5 • Views 



10 



15 
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<Views> 

<View> 
<ViewID> view-id </ViewID> 
<ViewName> view-name </ViewName> 

</View> 

<View> 
<ViewID> view-id </ViewID> 
<ViewName> view-name </ViewName> 

</View> 



25 



</Views> 



The descriptor <Views> lists views which are supported by a video system or device. Each view is specified 
so by the descriptor <View>. The descriptor <ViewName> contains a string which should match with one of the fol- 

lowing views used in the program description schemes: Thumbnail View, SlideView, FrameView, ShotView, Key- 
FrameView, HighlightView, EventView, and CloseUpView. 

[0141] The present inventors came to the realization that the program description scheme may be further modified 

35 to provide additional capabilities. Referring to FIG. 13, the modified program description scheme 400 includes four 
separate types of information, namely, a syntactic structure description scheme 402, a semantic structure description 
scheme 404, a visualization description scheme 406, and a meta information description scheme 408. It is to be un- 
derstood that in any particular system one or more of the description schemes may be included, as desired. 
[0142] Referring to FIG. 14, the visualization description scheme 406 enables fast and effective browsing of video 

40 program (and audio programs) by allowing access to the necessary data, preferably in a one-step process. The visu- 
alization description scheme 406 provides for several different presentations of the video content (or audio), such as 
for example, a thumbnail view description scheme 410, a key frame view description scheme 412 : a highlight view 
description scheme 414, an event view description scheme 416, a close-up view description scheme 418, and an 
alternative view description scheme 420. Other presentation techniques and description schemes may be added, as 

45 desired. The thumbnail view description scheme 410 preferably includes an image 422 or reference to an image rep- 
resentative of the video content and a time reference 424 to the video. The key frame view description scheme 412 
preferably includes a level indicator 426 and a time reference 428. The level indicator 426 accommodates the pres- 
entation of a different number of key frames for the same video portion depending on the user's preference. The 
highlight view description scheme 414 includes a length indicator 430 and a time reference 432. The length indicator 

50 430 accommodates the presentation of a different highlight duration of a video depending on the user's preference. 
The event view description scheme 41 6 preferably includes an event indicator 434 for the selection of the desired event 
and a time reference 436. The close-up view description scheme 41 8 preferably includes a target indicator 438 and a 
time reference 440. The alternate view description scheme preferably includes a source indicator 442. To increase 
performance of the system it is preferred to specify the data which is needed to render such views in a centralized and 

55 straightforward manner. By doing so, it is then feasible to access the data in a simple one-step process without complex 
parsing of the video. 

[0143] Referring to FIG. 15, the meta information description scheme 408 generally includes various descriptors 
which carry general information about a video (or audio) program such as the title, category, keywords, etc. Additional 
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descriptors, such as those previously described, may be included, as desired. 

[0144] Referring again to FIG. 13, the syntactic structure description scheme 402 specifies the physical structure of 
a video program (or audio), e.g., a table of contents. The physical features, may include for example, color, texture, 
motion, etc. The syntactic structure description scheme 402 preferably includes three modules, namely a segment 

5 description scheme 450, a region description scheme 452, and a segment/region relation graph description scheme 
454. The segment description scheme 450 may be used to define relationships between different portions of the video 
consisting of multiple frames of the video. A segment description scheme 450 may contain another segment description 
scheme 450 and/or shot description scheme to form a segment tree. Such a segment tree may be used to define a 
temporal structure of a video program. Multiple segment trees may be created and thereby create multiple table of 

io contents. For example, a video program may be segmented into story units, scenes, and shots, from which the segment 
description scheme 450 may contain such information as a table of contents. The shot description scheme may contain 
a number of key frame description schemes, a mosaic description scheme(s), a camera motion description scheme 
(s), etc. The key frame description scheme may contain a still image description scheme which may in turn contains 
color and texture descriptors. It is noted that various low level descriptors may be included in the still image description 

15 scheme under the segment description scheme. Also, the visual descriptors may be included in the region description 
scheme which is not necessarily under a still image description scheme. On example of a segment description scheme 
450 is shown in FIG. 16. 

[0145] Referring to FIG. 17, the region description scheme 452 defines the interrelationships between groups of 
pixels of the same and/or different frames of the video . The region description scheme 452 may also contain geometrical 

20 features, color, texture features, motion features, etc. 

[01 46] Referring to FIG. 1 8, the segment/region relation graph description scheme 454 defines the interrelationships 
between a plurality of regions (or region description schemes), a plurality of segments (or segment description 
schemes), and/or a plurality of regions (or description schemes) and segments (or description schemes). 
[01 47] Referring again to FIG. 13, the semantic structure description scheme 404 is used to specify semantic features 

25 of a video program (or audio), e.g. semantic events. In a similar manner to the syntactic structure description scheme, 
the semantic structure description scheme 404 preferably includes three modules, namely an event description scheme 
480, an object description scheme 482, and an event/objection relation graph description scheme 484. The event 
description scheme 480 may be used to form relationships between different events of the video normally consisting 
of multiple frames of the video. An event description scheme 480 may contain another event description scheme 480 

30 to form a segment tree. Such an event segment tree may be used to define a semantic index table for a video program. 
Multiple event trees may be created and thereby creating multiple index tables. For example, a video program may 
include multiple events, such as a basketball dunk, a fast break, and a free throw, and the event description scheme 
may contain such information as an index table. The event description scheme may also contain references which link 
the event to the corresponding segments and/or regions specified in the syntactic structure description scheme. On 

35 example of an event description scheme is shown in FIG. 19. 

[0148] Referring to FIG. 20, the object description scheme 482 defines the interrelationships between groups of 
pixels of the same and/or different frames of the video representative of objects. The object description scheme 482 
may contain another object description scheme and thereby form an object tree. Such an object tree may be used to 
define an object index table for a video program. The object description scheme may also contain references which 

40 link the object to the corresponding segments and/or regions specified in the syntactic structure description scheme. . 
[0149] Referring to FIG. 21, the event/object relation graph description scheme 484 defines the interrelationships 
between a plurality of events (or event description schemes), a plurality of objects (or object description schemes), 
and/or a plurality of events (or description schemes) and objects (or description schemes). 

[0150] After further consideration, the present inventors came the realization that the particular design of the user 
45 preference description scheme is important to implement portability, while permitting adaptive updating, of the user 
preference description scheme. Moreover, the user preference description scheme should be readily usable by the 
system while likewise being suitable for modification based on the user's historical usage patterns. It is possible to 
collectively track all users of a particular device to build a database for the historical viewing preferences of the users 
of the device, and thereafter process the data dynamically to determine which content the users would likely desire. 
50 However, this implementation would require the storage of a large amount of data and the associated dynamic process- 
ing requirements to determine the user preferences. It is to be understood that the user preference description scheme 
may be used alone or in combination with other description scheme. 

[0151] Referring to FIG. 22, to achieve portability and potentially decreased processing requirements the user pref- 
erence description scheme 20 should be divided into at least two separate description schemes, namely, a usage 
55 preference description scheme 500 and a usage history description scheme 502. The usage preference description 
scheme 500, described in detail later, includes a description scheme of the user's audio and/or video consumption 
preferences. The usage preference description scheme 500 describes one or more of the following, depending on the 
particular implementation, (a) browsing preferences, (b) filtering preferences, (c) searching preferences, and (d) device 
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preferences of the user. The type of preferences shown in the usage preference description scheme 500 are generally 
immediately usable by the system for selecting and otherwise using the available audio and/or video content. In other 
words., the usage preference description scheme 500 includes data describing audio and/or video consumption of the 
user. The usage history description scheme 502, described in detail later, includes a description scheme of the user's 

5 historical audio and/or video activity, such as browsing, device settings, viewing, and selection. The usage history 
description scheme 502 describes one or more of the following, depending on the particular implementation, (a) brows- 
ing history, (b) filtering history, (c) searching history, (d) device usage history, and (e) the time of action of the imple- 
mentation. The type of preferences shown in the usage history description scheme 502 are not generally immediately 
usable by the system for selecting and otherwise using the available audio and/or video content. The data contained 

10 in the usage history description scheme 502 may be considered generally "unprocessed", at least in comparison to 
the data contained in the usage preferences description scheme 500 because it generally contains the historical usage 
data of the audio and/or video content of the viewer. 

[0152] In general, storing the user's usage history including facts that the user viewed and selected programs and 
browsing procedures thereof viewed, and utilizing a variety of algorithms, a machine may automatically prepare the 

15 user's preferences. Utlizing the user's history description scheme may update the user's preference description. As 
an example, taking statistics of such history information, user's preference information may be derived. 
[0153] Since history information and preference information are independently managed, the preference information 
may be updated as desired. Thus, a machine less capable of preparing user's preference information — e.g., a mobile 
terminal — may only store the history information, then transmit the history information to another machine more capable 

20 of preparing preference information so as to update the preference information. Further, a user's taste information may 
be derived from a plurality of history informations of the same user. 

[0154] Furthermore, since the machine contains the user's viewing history informations and user's preference infor- 
mations, when restarting the machine, the following program or content may be automatically provided to the user for 
viewing, and a new program may be recommended based on the preference information. 

25 [0155] After consideration of the usage preference description 500 and the usage history description 502, the present 
inventors came to the realization that in the home environment many different users with different viewing and usage 
preferences may use the same device. For example, with a male adult preferring sports, a female adult preferring 
afternoon talk shows, and a three year old child preferring children's programming, the total information contained in 
the usage preference description 500 and the usage history description 502 will not be individually suitable for any 

30 particular user. The resulting composite data and its usage by the device is frustrating to the users because the device 
will not properly select and present audio and/or video content that is tailored to any particular user. To alleviate this 
limitation, the user preference description 20 may also include a user identification (user identifier) description 504. 
The user identification description 504 includes an identification of the particular user that is using the device. By 
incorporating a user identification description 504 more than one user may use the device while maintaining a different 

35 or a unique set of data within the usage preference description 500 and the usage history description 502. Accordingly, 
the user identification description 504 associates the appropriate usage preference description(s) 500 and usage his- 
tory description(s) 502 for the particular user identified by the user identification description 504. With multiple user 
identification descriptions 504, multiple entries within a single user identification description 504 identifying different 
users, and/or including the user identification description within the usage preference description 500 and/or usage 

40 history description 502 to provide the association therebetween, multiple users can readily use the same device while 
maintaining their individuality. Also, without the user identification description in the preferences and/or history, the 
user may more readily customize content anonymously. In addition, the user's user identification description 504 may 
be used to identify multiple different sets of usage preference descriptions 500 - usage history descriptions 502, from 
which the user may select for present interaction with the device depending on usage conditions. The use of multiple 

45 user identification descriptions for the same user is useful when the user uses dultiple different types of devices, such 
as a television, a home stereo, a business television, a hotel television, and a vehicle audio player, and maintains 
multiple different sets of preference descriptions. Further, the identification may likewise be used to identify groups of 
individuals, such as for example, a family In addition, devices that are used on a temporary basis, such as those in 
hotel rooms or rental cars, the user identification requirements may be overridden by employing a temporary session 

so user identification assigned by such devices. In applications where privacy concerns may be resolved or are otherwise 
not a concern, the user identification description 504 may also contain demographic information of the user. In this 
manner, as the usage history description 502 increases during use over time, this demographic data and/or data re- 
garding usage patterns may be made available to other sources. The data may be used for any purpose, such as for 
example, providing targeted advertising or programming on the device based on such data. 

55 [01 56] Referring to FIG. 23, periodically an agent 51 0 processes the usage history description(s) 502 for a particular 
user to "automatically" determine the particular user's preferences. In this manner, the user's usage preference de- 
scription 500 is updated to reflect data stored in the usage history description 502. This processing by the agent 510 
is preferably performed on a periodic basis so that during normal operation the usage history description 502 does not 
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need to be processed, or otherwise queried, to determine the user's current browsing, filtering, searching, and device 
preferences. The usage preference description 500 is relatively compact and suitable for storage on a portable storage 
device, such as a smart card, for use by other devices as previously described. 

[0157] Frequently, the user may be traveling away from home with his smart card containing his usage preference 

5 description 500. During such traveling the user will likely be browsing, filtering, searching, and setting device prefer- 
ences of audio and/or video content on devices into which he provided his usage preference description 500. However, 
in some circumstances the audio and/or video content browsed, filtered, searched, and device preferences of the user 
may not be typically what he is normally interested in. In addition, for a single device the user may desire more than 
one profile depending on the season, such as football season, basketball season, baseball season, fall, winter, summer, 

10 and spring. Accordingly, it may not be appropriate for the device to create a usage history description 502 and thereafter 
have the agent 510 "automatically" update the user's usage preference description 500. This will in effect corrupt the 
user's usage preference description 500. Accordingly, the device should include an option that disables the agent 510 
from updating the usage preference description 500. Alternatively, the usage preference description 500 may include 
one or more fields or data structures that indicate whether or not the user desires the usage preference description 

*5 500 (or portions thereof) to be updated. 

[01 58] Referring to FIG. 24, the device may use the program descriptions provided by any suitable source describing 
the current and/or future audio and/or video content available from which a filtering agent 520 selects the appropriate 
content for the particular user(s). The content is selected based upon the usage preference description for a particular 
user idenlification(s) to determine a list of preferred audio and/or video programs. 

20 [0159] As it may be observed, with a relatively compact user preference description 500 the user's preferences are 
readily movable to different devices, such as a personal video recorder, a TiVO player, a RePlay Networks player, a 
car audio player, or other audio and/or video appliance. Yet, the user preference description 500 may be updated in 
accordance with the user's browsing, filtering, searching, and device preferences. 

[0160] Referring to FIG. 25, the usage preference description 500 preferably includes three different categories of 
25 descriptions, depending on the particular implementation. The preferred descriptions include (a) browsing preferences 
description 530, (b) filtering and search preferences description, 532 and (c) device preferences description 534. The 
browsing preferences description 530 relates to the viewing preferences of audio and/or video programs. The filtering 
and search preferences description 532 relates to audio and/or video program level preferences. The program level 
preferences are not necessarily used at the same time as the (browsing) viewing preferences. For example, preferred 
30 programs can be determined as a result of filtering program descriptions according to user's filtering preferences. A 
particular preferred program may subsequently be viewed in accordance with user's browsing preferences. Accordingly, 
efficient implementation may be achieved if the browsing preferences description 530 is separate, at least logically, 
from the filtering and search preferences description 532. The device preferences description 534 relates to the pref- 
erences for setting up the device in relation to the type of content being presented, e.g. romance, drama, action, 
35 violence, evening, morning, day, weekend, weekday, and/or the available presentation devices. For example, presen- 
tation devices may include stereo sound, mono sound, surround sound, multiple potential displays, multiple different 
sets of audio speakers, AC-3, and Dolby Digital. It may likewise be observed that the device preferences description 
534 is likewise separate, at least logically, from the browsing description 530 and filtering/search preferences descrip- 
tion 532. 

40 [0161] The browsing preferences description 530 contains descriptors that describe preferences of the user for 
browsing multimedia (audio and/or video) information. In the case of video, for example, the browsing preferences may 
include user's preference for continuous playback of the entire program versus visualizing a short summary of the 
program. Various summary types may be described in the program descriptions describing multiple different views of 
programs where these descriptions are utilized by the device to facilitate rapid non-linear browsing, viewing, and nav- 

45 igation. Parameters of the various summary types should also be specified, i.e., number of hierarchy levels when the 
keyframe summary is preferred, or the time duration of the video highlight when highlight summary is preferred. In 
addition, browsing preferences may also include descriptors describing parental control settings. A switch descriptor 
(set by the user) should also be included to specify whether or not the preferences can be modified without consulting 
the user first. This prevents inadvertent changing or updating of the preferences by the device. In addition, it is desirable 

50 that the browsing preferences are media content dependent. For example, a user may prefer 1 5 minute video highlight 
of a basketball game or may prefer to see only the 3-point shots. The same user may prefer a keyframe summary with 
two levels of hierarchy for home videos. 

[0162] The filtering and search preferences description 532 preferably has four descriptions defined therein, depend- 
ing on the particular embodiment. The keyword preferences description 540 is used to specify favorite topics that may 
55 not be captured in the title, category, etc., information. This permits the acceptance of a query for matching entries in 
any of the available data fields. The content preferences description 542 is used to facilitate capturing, for instance, 
favorite actors, directors. The creation preferences description 544 is used to specify capturing, for instance, titles of 
favorite shows. The classification preferences description 546 is used to specify descriptions, for instance, a favorite 
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program category. A switch descriptor, activated by the user, may be included to specify whether or not the preferences 
may be modified without consulting the user, as previously described. 

[0163] The device preferences description 534 contains descriptors describing preferred audio and/or video render- 
ing settings, such as volume, balance, bass, treble, brightness, contrast, closed captioning, AC-3, Dolby digital, which 

5 display device of several, type of display device, etc. The settings of the device relate to how the user browses and 
consumes the audio and/or video content. It is desirable to be able to specify the device setting preferences in a media 
type and content-dependent manner. For example the preferred volume settings for an action movie may be higher 
than a drama, or the preferred settings of bass for classical music and rock music may be different. A switch descriptor, 
activated by the user, may be included to specify whether or not the preferences may be modified without consulting 

10 the user, as previously described. 

[01 64] Referring to FIG. 26, the usage preferences description may be used in cooperation with an MPEG-7 compliant 
data stream and/or device. MPEG-7 descriptions are described in ISO/IEC JTC1/SC29/WG11 "MPEG-7 Media/Meta 
DSs (V0.2), August 1999, incorporated by reference herein. It is preferable that media content descriptions are con- 
sistent with descriptions of preferences of users consuming the media. Consistency can be achieved by using common 

15 descriptors in media and user preference descriptions or by specifying a correspondence between user preferences 
and media descriptors. Browsing preferences descriptions are preferably consistent with media descriptions describing 
different views and summaries of the media. The content preferences description 542 is preferably consistent with, e. 
g.,a subset of the content description of the media 552 specified in MPEG-7 by content description scheme. The 
classification preferences description 544 is preferably consistent with, e.g., a subset of the classification description 

20 554 defined in MPEG-7 as classification description scheme. The creation preferences description 546 is preferably 
consistent with, e.g., a subset of the creation description 556 specified in MPEG-7 by creation description scheme. 
The keyword preferences description 540 is preferably a string supporting multiple languages and consistent with 
corresponding media content description schemes. Consistency between media and user preference descriptions is 
depicted or shown in FIG. 26 by couble arrows in the case of content, creation, and classification preferences. 

25 [0165] Referring to FIG. 27, the usage history description 502 preferably includes three different categories of de- 
scriptions, depending on the particular implementation. The preferred descriptions include (a) browsing history de- 
scription 560, (b) filtering and search history description 562, and (c) device usage history description 564, as previously 
described in relation to the usage preference description 500. The filtering and search history description 562 preferably 
has four descriptions defined therein, depending on the particular embodiment, namely, a keyword usage history de- 

30 scription 566, a content usage history description 568, a creation preferences description 570, and a classification 
usage history description 572, as previously described with respect to the preferences. The usage history description 
502 may contain additional descriptors therein (or description if desired) that describe the time and/or time duration of 
information contained therein. The time refers to the duration of consuming a particular audio and/or video program. 
The duration of time that a particular program has been viewed provides information that may be used to determine 

35 user preferences. For example, if a user only watches a show for 5 minutes then it may not be a suitable preference 
for inclusion the usage preference description 500. In addition, the present inventors came to the realization that an 
even more accurate measure of the user's preference of a particular audio and/or video program is the time viewed in 
light of the total duration of the program. This accounts for the relative viewing duration of a program. For example 
watching 30 minutes of a 4 hour show may be of less relevance than watching 30 minutes of a 30 minute show to 

40 determine preference data for inclusion in the usage preference description 500. 

[0166] Referring to FIG. 28, an exemplary example of an audio and/or video program receiver with persistent storage 
601 is illustrated. As shown, audio/video program descriptions 600 are available from the broadcast or other source, 
such as a telephone line. The user preference 606 description facilitate personalization of the browsing 603, filtering 
and search 604, and device settings in the display 602. In this embodiment, the user preferences 606 are stored at 

45 the user's terminal with provision for transporting it to other systems, for example via a smart card 607. Alternatively, 
the user preferences 606 may be stored in a server and the content adaptation can be performed according to user 
descriptions at the server and then the preferred content is transmitted to the user. The user 61 1 may directly provide 
the user preferences 606, if desired. The user preferences 606 and/or user history 609 may likewise be provided to a 
service provider 61 0. The system may employ an application that records user's usage history 609 in the form of usage 

50 history description, as previously defined. The usage history description is then utilized by another application, e.g., a 
smart agent, to automatically (608) map usage history 609 to user preferences 606. 

Additional Attributes and Descriptors 

55 in The Description and The Description Scheme 

[01 67] The present inventors came to the realization that additional functionality for the system may be achieved by 
the incorporation of particular types of information in the descriptions and description schemes. A description scheme 
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is a data model of descriptions. It specifies the descriptors and their syntax as they are used in the description. In what 
follows, use the terms description and description scheme may be used interchangeably since they both correspond 
to describing media and user preferences. An explanation of the additional attributes and descriptors in the descriptions 
will be provided, followed by an example of portions of example descriptions. 

5 [0168] After further consideration, there is a need for many users to maintain multiple separate user preference 
descriptions. Multiple user preference descriptions may correspond to, for example, different locations (e.g., at home, 
at the office, away from home, stationary versus traveling in a vehicle), different situations, different times (e.g., different 
days, different seasons), different emotional states of the user (e.g., happy mood versus tired or sad), and/or persistence 
(e.g., temporary usage versus permanent usage). Further, the user preference descriptions may include differentiation 

10 for different terminals with different primary functionalities (e.g., a personal video recorder versus a cell phone). In 
addition, available communication channel bandwidth at different locations or situations may use different preferences. 
Also, the preference of a user for the length of an audiovisual summary of a video program for downloading may be 
different. The user in different usage conditions may use the user identification description scheme as a basis to dis- 
tinguish between different devices and/or services. An example of different conditions may include a television broad- 

15 cast receiver and a cellular telephone. 

[0169] In addition to maintaining multiple user preferences for a particular user based on the aforementioned con- 
ditions, the present inventors also came to the realization that the different locations, different situations, different 
emotional states, different seasons, and/or different terminals (etc.), may likewise be used as the basis for distinguishing 
between the user preference descriptions. 

20 [0170] One technique to permit a particular user to have multiple preference descriptions and distinguishing them 
from one another is by using different usernames or by using a versioning mechanism, such as a version descriptor 
in the identification description scheme, as described later. 

[0171] As previously described, the system may include multiple user preference descriptions for a particular user. 
With multiple descriptions, the system may express the different user preferences with different granularity, e.g., a 

25 greater or lesser amount of detail. The increased granularity (sparseness) may be merely the result of applying a filter 
to the user preference description that further reduces the amount of data. In other words, the structure of the usage 
preference description may be identical with the difference being the result of the filter further reducing the data. In 
another embodiment, the variable granularity results in a different size of the data contained in the user preferences, 
which may be based upon, if desired, the location and/or application of the user. User preferences with increased 

30 granularity may be especially suitable for storage on portable memory devices with limited memory capability. Likewise, 
the granularity may be applied to the usage history. 

[0172] Another aspect of the present invention permits the user preferences (and history) to be based upon the 
media type, media source, or content (e.g., music versus video, radio versus television broadcast, and/or sports video 
versus home video). These preferences relate to the audio and/or video itself, as opposed to a third party characteri- 

35 zation of the desirability of the multimedia. The inclusion of this information permits a reduction in the computational 
processing requirements depending on the media type, media source, and/or content of the media. 
[0173] Another feature that may be included in the system is a protection attribute for each, or a selected set of, 
component of the user descriptions. The protection attributes specifies the access right of a system or service provider, 
typically a party other than the user himself, to the user's descriptions or any component thereof. In one embodiment, 

40 the protection attributes may specify the user's desire to permit others access to such data. One technique to implement 
the protection attribute is to include a protection attribute as a primitive attribute that is contained by all relevant parts 
of the user description scheme. 

[0174] Descriptors and description schemes for browsing preferences may be aligned with particular types of mul- 
timedia summary description schemes that are contained in ISO/I EC JTC1/SC29/WG11 N3246, "MPEG-7 Generic AV 

45 Description Schemes, Working Draft v2.0", Noordwijkerhout, March 2000. This allows the user to specify the type of 
a particular visual summary of an audiovisual program, and the duration of a summary that is in the form of a visual 
highlight. However, after further consideration the present inventors have determined that specification of the preferred 
minimum and maximum amount of data permitted in an audiovisual summary significantly enhances the system ca- 
pability. Such a provision provides, for example, the capability of the user effectively browsing audiovisual summaries 

50 of content over channels with limited bandwidth and using terminals with different limitations. With a terminal connected 
to a bandwidth limited channel, the user may specify preference for a relatively short highlight of the program, while 
with a terminal that is connected to a higher bandwidth channel, the user may specify preference for a longer highlight 
of the program. Such a set of channels may be mobile channels and cable channels. In addition, for terminals that are 
not capable of displaying frames at a video rate, the user may prefer keyframe summaries consisting of a maximum 

55 number of keyframes appropriate for the communication channel bandwidth. To achieve these enhancements, the 
present inventors propose using descriptors in the browsing preferences description (and description scheme, or other 
preferences description) specifying the minimum, maximum, and exact number of keyframes, and minimum, maximum, 
and exact duration of audio and/or visual highlights. 
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[0175] As described, the description scheme is adaptable to express the preferred minimum and maximum amount 
of visual material to adapt to different viewing preferences as well as terminal and communication channel bandwidth 
limitations. This implementation may be achieved by the following descriptors included in the browsing preferences 
description scheme: MaxNumOfKeyframes, M in NumOf Keyframes, NumOf Keyframes, MaxSummaryDuration, Min- 

5 SummaryDuration, and Summary Duration. The MaxNumOfKeyframes and MinNumof Keyframes preference descrip- 
tors specify, respectively, the maximum and minimum number of keyframes in the keyframe-summary of a video pro- 
gram. NumOf Keyframes descriptor specifies the standard number of keyframes. Depending on the known bandwidth 
conditions of a known connection that the user uses regularly, he or she may specify these descriptors. The MaxSum- 
maryDuration and MinSummaryDuration descriptors specify, respectively, the maximum and minimum temporal dura- 

10 tion of an audiovisual highlight summary. SummaryDuration descriptor specifies the standard duration of highlight- 
summary. Again, depending on user's taste, terminal, and channel limitations, the user may specify these descriptors. 
The MaxSummaryDuration and MinSummaryDuration descriptors apply to preferences for audio signals as well as 
where audio highlights may have been generated by audio skimming methods. User's browsing preference descriptions 
may be correlated with media descriptions by a filtering agent 520 in Fig. 24 in order to determine media descriptions 

is that contain summary descriptions that match user's preference descriptions and provide the user the associated sum- 
marized media in the preferred type of summary. 

[0176] An additional descriptor that may be introduced is an abstraction fidelity descriptor for universal multimedia 
access application, where fidelity of a summary abstraction of a program is described. This can correspond to the 
variation fidelity descriptor defined in ISO/lEC JTC1/SC29 WG11 N3246, "MPEG-7 Multimedia Description Schemes, 

20 Working Draft v2.0", Noordwijkerhout, March 2000. This provides an alternative to the explicit specification of the du- 
ration and bounds on the number of keyframes. A Segment Theme descriptors) may describe the preferred theme, 
or point of view, of a segment, e.g., a video or audio clip, annotated with its theme or emphasis point. For example, 
the theme may specify characteristics of the content of the theme. Such characterization may include a goal from your 
favorite team, 3-point shots from your favorite player, etc. Specifying these descriptor(s) and also ranking them enables 

25 a client application or a server to provide to the user segments according to preferred themes (and/or their ranking) 
matching to the their labels or descriptors at the segment level, or provide users with pre-assembled highlights com- 
posed of segments with labels matching the SegmentTheme preference. 

[0177] Existing filtering and search user preference descriptions are directed to techniques of using the audiovisual 
content in an effective manner by finding, selecting and consuming the desired audiovisual material, while focusing on 

30 the content of the audiovisual materials. While such descriptions are beneficial, the present inventors came to the 
further realization that the identification of the source of the material, in contrast to merely its content, provides beneficial 
information for the processing and presentation of the audiovisual materials. For example, the source of the content 
may be from terrestrial sources, digital video disc, cable television, analog broadcast television, digital broadcast tel- 
evision, analog radio broadcasts, and digital radio broadcasts. The inclusion of this information permits the user to 

35 select among these different sources and increase effectiveness by narrowing down the choices to those sources that 
are available to the user, such as terrestrial broadcast which is more widely available than satellite broadcast. For 
example, user may describe user's preference for "Star Trek" episodes that are available from terrestrial broadcast 
channels only. 

[0178] This source distinction and identification may be performed by including a source preferences description 
40 scheme under the filtering and search preferences description scheme (or other description scheme). Accordingly, the 
search and preferences description scheme may include from zero or one (or more if desired) source preferences 
description scheme. The source preferences description scheme may be derived from the Media Format description 
scheme or Publication Description Scheme specified in ISO/lEC JTC1/SC29/WG11 N3247, MPEG-7 Multimedia De- 
scription Schemes, Experimentation Model (v2.0) Noordwijkerhout, March 2000. 
45 [01 79] Another feature that may be included in the system, in addition to the users preferences, is the user's negative 
preferences. The negative preferences may include the user's dislikes and their relative rankings. By specifying the 
negative preferences, the system is less likely to select such matching preferences. This may be implemented, for 
example, by permitting positive and negative values to the preferencevalue descriptor. 

[0180] Another feature that may be included in the system is the specification of the user's preferences as a relative 
so preference measure of a particular set of user preferences with respect to another set of preferences, such as for 
example, by using BetterThan and WorseThan descriptors. This permits an implicit relative ranking of preferences 
even in the absence of a preference value descriptor for each preference set. This may be implemented, for example, 
by including Betterthan and WorseThan descriptors in the filtering and search preferences descriptions. 

55 Expression of the Additional Attributes 

[0181] The following descriptions are expressed in XML (Extensible Markup Language), incorporated by reference 
herein. It is to be understood that any other description language may likewise be used. 
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[0182] The definition of the user preference description may be as follows. 
<UserPreference> 

5 

<UserIdentifier protection-'true" userName= !, paur , /> 
<UsagePreferences aIlowAutomaticUpdate="false"> 
1 o <Bro wsingPreferences> 



15 



25 



30 



35 



</BrowsingPreferences> 
<FilteringAndSearchPreferences> 



20 </FilteringAndSearchPreferences> 

<DevicePreferences> 



</DevicePreferences> 
<AJsageHistoiy> 

</UsageHistory> 
</UserPreference> 



[0183] The primitive attributes "protection" and "allowAutomaticUpdate" may be instantiated in the Userldentifier, 
Usage Preferences, and Usage History descriptions and ail its relevant parts, namely, in Browsing Preferences de- 
scription, Filtering and Search Preferences description, Device Preferences description, and sub-description schemes 
40 of the Usage History description Scheme. 

[0184] The "allowAutomaticUpdate" attribute (set by the user) should be included in a description scheme specifying 
whether or not the preferences can be automatically modified (e.g., by an agent utilizing the usage history description) 
without consulting with the user. 

[0185] The protection attribute should be included in a description specifying whether the user allows the system to 
45 make preference/history public or not. When the user agrees to make some parts of his preference/history public, for 
example, to service providers, the service providers can collect this information and then serve to the user contents 
that are tailored to the user's history/preferences. In the above example description, the user prefers to keep his user- 
name private. He also does not wish the system to automatically update his preferences. 

[0186] The user identification description serves the purpose of an identifier that distinguishes a particular instanti- 
50 ation of the user description scheme from other instantiations for other users or other instantiations for the same user 
for different usage conditions and situations. 

[0187] The username descriptor may identify a specific user from other users. In a home setting, each member of 
the household may be identified using a username that is unique in the household for all devices that the members of 
that household use on a regular basis. A username can also be used to distinguish the user description scheme of not 
55 only an individual but also a group of people, e.g., the family. Those devices that are used on a temporary basis, 
potentially by many different people, (such as those in hotel rooms or rental cars) may assign temporary session 
identifications to ensure uniqueness of identifications. 

[0188] Alternatively, a version descriptor may also be included in the user identifier description to define different 
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versions of the user descriptions (preferences and usage history) associated with a particular username. Through the 
mechanism of the version, a person can specify different preferences and usage history, corresponding to different 
locations (at home, at the office, away from home, stationary versus traveling in a vehicle), different situations, different 
emotional states (happy versus sad), different seasons, etc. Different user descriptions are distinguished by distinct 
5 version descriptors. The type of the version descriptor, may be for example, an integer, a string, or expressed as an 
attribute of the user identification description scheme. 

[0189] The usage preference description may include a PreferenceType description, distinguishing a particular set 
of preferences or history according to time, or place, or a place and time combination. The definition of the usage 
preference description may be as shown in the following example, where place is "office" and time period is "8 hours 
10 starting from 8 AM" 



15 



20 



25 



30 



35 



<PreferenceTyp e> 
<Place> 

<PlaceName xml:lang= H en n >Ofrice</PlaceName> 

</Place> 

<Time> 

<TimePoint> 

<h>8<Vh> 



</TimePoint> 
<Duxation> 
<No__h>8</No_h> 
</Duration> 



40 

</Tiine> 
</PreferenceType> 

45 



[0190] The preferencetype descriptor may be used to identify the preference type of one or more set of preferences. 
50 As previously described, a user may have different preferences depending on the user's situation, location, time, sea- 
son, and so on. 

[0191] The browsing preferences description may describe preferences of the user for browsing multimedia infor- 
mation. In essence, this description expresses the user's preferences for consuming (viewing, listening) a multimedia 
information. This browsing preferences description may include for example, a Summary Preferences description. The 
55 browsing preferences description may include in the case of video, for example, the user's preferences for continuous 
playback of the entire program versus visualizing a short summary of the program. Various summary types are specified 
in the Summary Description Scheme in ISO/I EC JTC1/SC29 WG11 N3246, "MPEG-7 Multimedia Description Schemes, 
Working Draft v2.0", Noordwijkerhout, March 2000, including a keyframe summary, a highlight summary, etc., where 
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parameters of the various summary types may also be specified by summary descriptions, e.g., the time duration of 
the video highlight summary. 

[0192] The browsing preferences description scheme may include one or more of the following non-exhaustive list 
of descriptors and descriptions in its description scheme. 

5 

(A) The minimum number of keyframes (MinNumOfKeyframes) and the maximum number of keyframes (MaxNu- 
mOfKeyframes) descriptors may be included. These descriptors specify the user's preference for minimum and 
maximum number of frames in a keyframe summary of an audiovisual program. A user can specify these descrip- 
tors according to personal taste, situation, etc., and according to channel bandwidth and terminal resource limita- 

10 tion. 

(B) The minimum duration (MinSummaryDuration) and the maximum duration (MaxSummaryDuration) descriptors 
may be included. These descriptors specify the user's preference for the length of a highlight summary composed 
of key clips in the video. These descriptors may also, for example, be applied to an audio-only material. A user 
can specify these descriptors according to personal taste, situation, etc., and according to channel bandwidth and 

'5 terminal resource limitations. 

An example for Summary Preferences description that can be included in usage preferences description is 
provided below. 



20 </UsagePreferences> 

</BrowsingPreferences> 
<SummaryPreferences> 

25 

<SummaryTypePreference>keyVideoClips<^SummaiyTypePreference> 



30 



35 



40 



<MmSummaryDuration><m>3<^ 
<MaxSummaryDuration><m>6<^ni>^ 
</SummaryPreferences> 
</BrowsingPrefcrcnces> 
</U sagcPrcferences> 



(C) The abstraction fidelity descriptor for universal multimedia access application relates to fidelity of a summary 
abstraction of a program. This preference descriptor may correspond to the variation fidelity descriptor contained 

45 in the media's variation description specified by Variation Description Scheme in ISO/DEC JTC1/SC29 WG11 

N3246 : "MPEG-7 Multimedia Description Schemes, Working Draft v2.0", Noordwijkerhout, March 2000. Alterna- 
tively, the duration and number of keyframes may be defined as the fidelity descriptor. 

(D) The SegmentTheme descriptor(s) may be included, which describes the theme or point of view of a segment, 
e.g., a video or audio clip annotated with its theme or emphasis point. An example summary preference description 

so expressing preference for video segments (clips) labeled as "Goal from Spain" and "Replay of Goal from Spain" 

is as follows: 



55 
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</UsagePreferences> 
5 </BrowsingPreferences> 

<SummaryPreferences> 

10 

<SummaryTypePreference>KeyVideoClips</SumniaryTypePreference> 
<SegmentTheme>Goal from Spain</SegmentTheme> 

15 

<SegmentTheme>Replay of goal from SpaiiK/SegmentTheme> 
</SummaryPreferences> 
20 </BrowsingPreferences> 
</UsagePreferences> 

25 

(E) The frame frequency value descriptor may be included to specify the temporal sampling frequency of video 
frames that can be visualized in the browser. The frames provide a visual summary. Depending on the browser, 
they may also provide clickable entry points to the video. The user may click and start playing back the video 
starting from that frame. The frame frequency value descriptor provides similar functionality in terms of shots of 
30 the video. 

[0193] The source preference description describes the preferred source of multimedia information, such as the 
broadcast or storage medium type (e.g., terrestrial, satellite, DVD), broadcast channel identifier, etc. An example user 
preference description expressing preference for Star Trek episodes available from terrestrial broadcast is as follows. 

35 

<UserIdentifier protection="true" userName="paur/> 
<UsagePreferences aUowAutomaticUpdate="false n > 

40 

<FilteringAndSearchPrefereiices protection= ,, tnie ,, > 
<PreferenceValuc>5</PreferenceValue> 
45 <CreationPreferences> 

<Title xml:lang="en" type= n originar>Star Trek</Title> 

so 

55 
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</CreationPreferences> 

<SourccPrcferences> 

<PublicationType>TerrestriaI Broadcast</PublicationType> 

</SourcePreferences> 
</FilteringAndSearchPreferences> 
</UsagePreferences> 
</UserIdentifier> 

[0194] The filtering and search preferences description includes at least one of the descriptors of preferred program 
title, genre, language, actor, creator of the program. An example description where user's preference is for news pro- 
grams in English is given below. Such description may be included in user's smart card when he travels to Japan, for 
example. Note that this particular preference description is identified as being specific to Japan and differentiated by 
choosing an appropriate user name. 

<UserIdcntifier protection="true" userName^"paul_in_Japan ,, /> 
<UsagePreferences allow AutomaticUpdate="false w > 
<FilteringAndSearcbPrefcrences protection- 'true'^ 
<Preference ValuOl OCX/Preference Value> 

<ClassificationPreferences> 

<Language> 

<LanguagcCode>en</LanguageCode> 

</Language> 

<Genre>News</Genrc> 

</ClassificationPreferences> 
</FilteringAndSearchPreferences> 
</UsagePreferences> 

</UserIdentirier> 

[0195] The Preference Value descriptor provides a technique for prioritizing filtering and search preferences, such 
as the value indicating the degree of user's preference or non-preference. Non-preferences may be expressed by 
assigning a negative (opposite) value to the preference value descriptor. 

[0196] The betterthan and worsethan descriptors may describe which instantiation of preferences the user likes or 
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dislikes relatively more compared to another instantiation, where different instantiations are identified using the filtering 
and search preference type descriptor. This provides robustness against changes in the preference value descriptor 
automatically, for example, by an agent. 

[0197] The filtering and search preferences description may also contain a description of a preferred review to ex- 
5 press user's desire for searching for programs that are favorably reviewed by specific individuals. For example, pref- 
erence for movies reviewed by movie critics Siskel and Ebert and found to be "two-thumbs- up" may be described and 
included in the filtering and search preferences description. An overview of the entire description scheme is shown in 
FIG. 29. With the ever increasing amount of available media, such as audio, image, and videos, it becomes increasingly 
more difficult for a user to select desirable media for subsequent consumption . The user may manually peruse program 
10 listings to select the desired material. However, the manual selection of media from an exhaustive program listing is 
time consuming and inefficient. 

[0198] As previously discussed, a description scheme, such as those for the user, programs, and system, provides 
a structure within which is captured information regarding (1) the user's preferences, (2) the system, and (3) the pro- 
grams themselves. By processing the information contained within the user's usage description scheme and the pro- 
fs gram description scheme of available programs, the system may determine those programs that are most likely desir- 
able to the particular user. The processing by the system for such information may be referred to as an agent. 
[0199] Existing agents are focused on correlating a limited number of user preference descriptors with a limited 
number of program descriptors. The designer of such agents manually determines, and hard codes into the agent, 
predetermined interrelationships which are likely to result in identifying desired programs. As such, the mapping be- 
20 tween the user preference descriptors and the program descriptors includes a static model because such designers 
are under the belief that the domain of data fields is a fixed predetermined set, and therefore the relationships between 
the potential combinations of relevant data is likewise a fixed predetermined set. For example, the "actor" in the user 
preference and the "actor" in the program descriptor may be a relevant potential combination. The traditional focus for 
designing such static agents avoids the problematical dilemma of how to interpret and process an arbitrarily complex 
25 set of preferences. 

[0200] Maintaining the traditional focus of avoiding an arbitrarily complex set of user preferences, commercial prod- 
ucts such as TiVO and Replay TV, permit the specification of a first preference, such as a particular actor. The user 
may further attempt a more specific search by searching for a first preference, a second preference, and additional 
preferences. While this results in identifying the desired programs, it is a time consuming and frustrating process for 

30 the user. Like the static agents, the TiVO and Replay TV devices have a limited set of permitted search queries. 

[0201] While such static models of the interrelationships is readily easy to implement, it results in a system that is 
unable to process interrelationships that are not foreseen by the agent designer. The present inventors came to the 
realization that all of the potentially desirable interrelationships, especially for an arbitrarily complex set of preference 
criteria, can not be effectively programmed using the traditional static model. 

35 [0202] Referring to FIG. 30, a filter agent 600 receives or otherwise has access to at least one program description 
602 and at least one user preference description 604. Each program description 602 may also, if desired, include meta- 
data associated with the actual consumable program media. Also, the user preference description 604 contains se- 
lection criteria for the information contained within the meta-data. The output of the filter agent 600 is a subset of the 
received program descriptions 606 that have been selected, and tailored if needed, in accordance with the user pref- 

40 erence description 604. 

[0203] Referring to FIG. 31, the filter agent 600 receives the user preference description 604 and interprets the 
information contained within the user preference description 604 at block 610 using information from a mapping table 
614. The filter agent 600 also builds a model , such as a combinatorial model, of the user criteria indicating the desired 
user criteria at block 61 2 using information from the mapping table 614. The resulting model or otherwise set of criteria, 

is is then applied against the available program descriptions 602 at block 616 to select the desired programs. Each of 
the selected programs include a rich set of associated data which may then be applied against user criteria at block 
618 to further refine the data by extracting desirable sub-portions of each program. Each of the selected programs 
may further be cloned at block 620 together with the desired sub-portion of each program, and the resulting tailored 
instances are output from the filter agent 600. 

so [0204] Referring to FIG. 32, a Program Description may be composed of a hierarchy of individual descriptions. The 
hierarchy implies relationships amongst the individual description elements including composition, type-of, and other 
relationships. The particular structure of the input Program Descriptions vary, and are typically generated and provided 
by a commercial provider. The output Program Descriptions may be, for example, copies of the selected input instances, 
portions of the selected input instances, or are modified clones of the input instances. In the case of modified clones, 

55 the clones should describe a subset of the program media that meets the user's preferences, and exclude the portion 
that the user does not desire, or is not feasible to process for other reasons, such as bandwidth. It is to be understood 
that the output may omit cloning, if desired. 

[0205] Referring FIG. 33, the User Preference Description may include primitive elements that describe individual 
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or multiple preferences. The individual preferences may be generally defined in terms of "name : value" pairs. The 
name component is one of a set of Preference Names. The domain of the value depends on the name, such as codes 
and free form text 

[0206] The individual preference may be a single preference test (or multiple preference). It is to be understood that 

5 the individual preferences are not limited to tests. For example, the User Preferences may describe the desired con- 
figuration for presentation, such as volume or any other functionality. Each preference test describes some aspect or 
attribute of the Program Description that is to be evaluated. If desired, the preference test may be limited to the smallest 
granularity of a test that may be executed on the Program Description. A common preference test is a comparison of 
a Program Description element's value to the preference value. It is also to be understood that the preference tests 

10 need not be directly associated with the value of a corresponding element, if any, of the Program Description. For 
example, a single User Preference element, indicating a desired number of key-frames to be shown, may be tested 
against the count of elements in the Program Description representing (or describing) a key-frame. In general, the pair 
(title: title_value) will compare the title element value of the Program Description to title_value. 
[0207] After further consideration, the present inventors came to the realization that the individual preferences may 

15 include composite relationships. Moreover, the relationships may include duplicate fields, such as several instances 
of "name" in either the user preferences and/or the Program Descriptions. With the inclusion of composite relationships 
it becomes difficult to determine an appropriate technique for queries, where duplicate individual preferences are at 
the same or different hierarchy levels. In addition, it is difficult to determine how to interpret queries that provide multiple 
matching results (such as several instances of "John Doe") or inconsistent matching entries (such as several instances 

20 of "John Doe" and a lack of an instance of "Comedy"). For example, referring to FIG. 34, if the user uses a query 
involving multiple preference names, and the query results in several potential matches, it is difficult to determine if an 
appropriate program has been located or which portion of an appropriate program is suitable. As shown in FIG. 34, 
the preference with name A is composed of one instance of name B and two instances of name C, each of which may 
include the same or different names. 

25 [0208] Referring to FIG. 35 an example of a portion of a User Preference Description is illustrated. This portion of a 
user preference description illustrates a hierarchy of a "creator" that has the "role" of "author" with the name of "Thomas" 
"Aquinas". In addition, the hierarchy may be used as the path to define the interrelationships. 

[0209] The normal behavior of a location path is to retrieve the single data from this node in the program. One 
potential enhancement is to allow this data to be manipulated and combined with other related nodes to form a com- 
30 posite value. 

[0210] One example is when evaluating a media review rating, three numerical values may be provided, namely, 
RatingValue, WorstRating, and BestRating. A composite value for media review rating may be calculated as ((Rating- 
Value) - (WorstRating)) / ((BestRating) - (WorstRating)). 

[0211] Another example may include the test of Keyword preferences against the Title or Description fields by con- 
35 catenating these two fields. A composite value might be calculated as (Creation Description/Text An notation) & (Title/ 
TitleText). It is noted that these two fields use relative paths from the parent "Creation" element. 
[0212] Yet another example may include a single preference data manipulated to adjust its value numerically, or its 
text may be translated into a target language. 

[021 3] The composite values provide defaults for any of the calculated elements. This is useful for defining the default 
40 range of a media review rating. It is also useful for inserting identity values (e.g. 1, 0, "") when the absence of an 
element should not. make the test fail. 

[0214] The Preference Description may make user of built-in composite values. An example of built-in composite 
value may be based on the environment of the viewer. For instance, a portion of a Preference Description may define 
time of day ranges when the user wants the associated preferences to be evaluated. The target location could be 

45 defined as a composite value of built-in type "TimeOfDay". 

[0215] Referring to FIG. 36, the present inventors have determined that a mapping table of the User Preferences 
and the input Program Descriptions provides a robust comparison of the user preferences and the input Program 
Descriptions. It is to be understood that the mapping table may be any type of data structure, as desired. The first 
column ("name") of the mapping table includes the name of one or more of the user preferences. FIG. 36 illustrates 

so the inclusion of the user preferences of FIG. 35. Each node, generally referred to by name, of an individual preference 
to be tested has an ancestry path associated with it. The second column ("location") of the mapping table includes the 
name of one or more of the input Program Descriptions. Alternatively, portions of the path or even single elements of 
the path may be specified in the table, if desired. For example,the Creator/Jndividual/FamilyName preference in FIG. 
35 has a path of /FilteringAndSearch Preferences/Creation Preferences/ Creator/lndividual/FamilyName. This path is 

55 decomposed and resolved, piece by piece, using the "Location" column (e.g., field) in the Mapping Table. The result 
of decomposing and resolving the user preference path results in a corresponding path within the Program Description 
resolved from entries in column two. For example, the resulting location path for this test may be 
VProgram/CreationMetalnformation/Creation/Creator/ 
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Individual/FamilyName". 

[0216] Common names, such as "Country" used at multiple locations, may be distinguished by including all or part 
of the ancestry path. For example, the following two preference tests have the same "leaf" name, but it may be desirable 
to have different tests for each. This may be done by specifying more of the ancestry in the Name field (column 1) of 

5 the mapping table: 7FilteringAndSearch Preferences/Creation Preferences/ Creation Location/Country", and 

"/FilteringAndSearch Preferences/Classification Preferences/Country". To distinguish between the two, the following 
names may be used: "/Creation Location/Country" and 7ClassificationPreferences/Country'\ In addition the preference 
tests may be associated with multiple entries in the Mapping Table. This permits a single test to be performed on more 
than one location in the Program Description. 

10 [021 7] The Location field may include various wildcards to expand or restrict the target paths to be evaluated in the 
Program Description. For example, a "*" wildcard implies that there may be multiple instances of the given location 
under one parent, e.g., /Creation/*Creator implies that there may be multiple Creators under the Creation parent. A 
"#xxx" wildcard restricts the target path to the xxx instance of the given location under its parent, e.g., /Creation/ 
#002Creator restricts the target path to the second instance of Creator under Creation. A double forward slash 7/" 

is indicates a node of the target path which may be used as a base path for groups of tests which must be constrained 
to evaluate from the same common location. In particular, this is useful for Constrained-AND operations, described 
later. The preference paths may be used to build target locations in the program. These preference paths may also 
allow preference paths to be interpreted as locations. Composite values may be defined for these preference path 
locations. 

20 [021 8] Syntax for a default preference and a default location may be provided. This allows updates in the preference 

or program definition to be handled by the filter agent without requiring changes to the mapping table. 

[021 9] The default mapping elements may be specified for a limited set of preference branches to bound the default 

mapping to a safe portion of the user preferences. 

[0220] For instance, the default element 
25 "FilteringAndSearch Preferences/Creation Preferences/ 

UserDefined Preference/.*" may place a default mapping that can only map to elements in the program beneath the 

" P ro g ram/C reati on M eta Inform at ion/Creation" branch. 

[0221] The third column "TestOp" of the Mapping Table includes what comparison to perform between the corre- 
sponding user preference path (column 1 ) and (resolved) input Program Description location (column 2). In this manner, 

30 the Mapping Table provides a convenient manner of identifying the interrelationships between the corresponding data 
from the user preferences and input Program Descriptions. For instance, the "Family Name" preference in FIG. 35 has 
a test operator of substring-case-insensitive when compared with 7* Family Name". Test operators may yield a discrete 
result, such as true or false, they may yield a range of values, or any other desired data. In particular results that span 
a range of values provide the advantage that filtered programs may be sorted according to the resultant "similarity" 

35 value. This provides the user with a ranked output that they may select from. Also, user preferences maybe "softened" 
to pass programs that are near matches to the specific preference criteria. This fuzzy approach may allow the user 
preference description to more clearly model the user's intended criteria. In cases where the entry is always a parent 
(composed of children preference tests) the test operator may be NA (not applicable). An exemplary set of test operators 
are illustrated in FIG. 37. 

40 [0222] After the individual preferences are interpreted into individual preference tests, these tests may be combined 
into a single test that models the user's preferences. The preferred technique includes combining the individual pref- 
erence tests according to their hierarchy. Each parent test becomes the combination of its children tests, and this 
continues up to the root preference, yielding in effect one composite test. The combination of "children" tests within a 
single "parent" may be broken down into the combination of similar tests and the combination of dissimilar tests. Similar 

45 tests may have the same name or otherwise be associated in the Mapping Table such as by being on the same row. 
Also, dissimilar tests may have different entries in the Mapping Table. 

[0223] It is to be understood that the concept of inter group and intra group interrelations relates to any comparison 
between different sets of data, whether or not they include a hierarchical scheme. As an example, intragroup may be 
used to define a group of similar tests. Also, any scheme may be implemented to form comparisons or groupings for 
50 the testing of data. 

[0224] If desired, the mapping table, which may be any type of data structure or otherwise to simply express the 
desired operations to be performed, may be expanded to include additional functionality. For example, specific group- 
ings of user preference may be denoted, to specify additional operations to be performed on the elements of the group 
that are separate from the inter group and intra group operations. These specific groupings may provide additional 
55 flexibility for combining individual preference tests. The combinatorial operations applied to these groups may be per- 
formed before, after or instead of the general inter group and intra group combinatorial operations. 
[0225] For instance, entries in the mapping table may be explicitly linked together with a shared index, and a specific 
combinatorial operator may be mapped to each indexed group. The UserPreferences elements may likewise be ex- 
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plicitly linked together with a shared index. The latter two groups and operators present an alternative method to gen- 
erate the arbitrarily complex combinations, without using one of the tour methods for generating all typical permutations, 
shown in FIG. 38. A preferred sequence for performing the various combinatorial operations might be intra group 
operation, followed by indexed group operation, followed by inter group operation. 

s [0226] In addition to explicitly defined indexed groups, other groupings may be built-in. For instance, a program 
description may have attributes associated with it. The user preferences that are mapped to this program description 
and its associated attributes may be grouped together in a so-called attribute group, and a specific combinatorial 
operator may be mapped to this attribute group. For example, the program description element, TitleText, may have 
a language attribute associated with it. A user preference, Keyword Preferences, may be mapped to TitleText and a 

10 separate user preference may be mapped to the language attribute of TitleText. These two user preferences may be 
grouped together into the following attribute group, and the results to these two tests may be combined in an attribute 
group combinatorial operation: 



15 



Name 


Location 


AttrGroupOperation 


KeywordPreferences @ xml.lang 
K ey wo rd P ref e recn es 


Title/TitleText@xml.lang 
Title/TitleText 


AND 
AND 



The functionality may also include multi-mapped preference group and associated operator. Elements in this group 
20 may have the same user preference element, but have multiple different program description mappings. For example, 
PersonName may have the following mappings, forming one multi-mapped group: 



25 



Name 


Location 


M u It i M ap G rou pO p e rati o n 


PersonName 
PersonName 
PersonName 


Creator/GivenName 

Creator/FamilyName 

Creator/ProfessionalName 


OR 
OR 
OR 



Preferably, the various groupings are combined in sequence starting with attribute groups, followed by intra groups, 

30 multi-mapped groups, indexed groups, and inter groups. 

[0227] Referring to FIG. 38, exemplary examples of combinatorial operators are listed. Several of the combinatorial 
operators (notably SAND, SUM, FREQ, and RATIO) provide "soft" combinations that can be used to yield non-zero 
results, even when some of the individual preference tests in the combination are zero. These soft combinations are 
useful when a group of program descriptions are to be evaluated, but one or more nodes in the group does not exist. 

35 In this manner, the result will be a diminished, but non-zero value. 

[0228] For example, the SAND operator provides a soft AND combination of its constituent elements by applying a 
transformation to the input values before they are combined. This may transform a zero input to a non-zero value. 
Additionally, the combination operation may be a non-linear function that will increase or decrease the result, related 
to a strict AND combination. 

40 [0229] Another set of combinatorial operators are soft maximum and soft minimum operators. In the typical maximum 
or minimum operation, only one of the combined individual preference tests determines the combined result value. In 
contrast, the soft minimum operator and soft maximum operator allows other non-contributing individual preference 
test results to adjust the final combined result. Typically, the adjustment is a minor amount, e.g.. +-10 percent. The 
purpose of the soft maximum/minimum operators is shown in the example where a user prefers program which contain 

45 A or B. IF a program with A and a program with A and B were available, the typical maximum operator would rank both 
programs equally whereas the soft maximum operator would rank the program containing A and B above the program 
containing only A. A similar result occurs from the soft minimum. 

[0230] Another combinatorial operator is an average, which averages a set of scores resulting from a plurality of tests. 

[0231] One combination for dissimilar preference tests is under a single parent. Each entry in the Mapping table has 
50 a field that defines how this type of preference test should be combined with different type preference tests under the 

same parent. This type of test may be referred to as inter group combinatorial operator (InterOperator). 

[0232] Referring to FIG. 39, this example illustrates a parent test with four dissimilar "leaf" test children. Two of the 

leaf tests have InterOperator AND. These two tests are evaluated independently, then their results are AND'd together. 

The other two leaf tests have InterOperator OR. These two tests are evaluated independently, then their results and 
55 the AND'd sub-result are all OR'd together to form the parent test result. 

[0233] The rules for combining dissimilar tests (with the operator mappings of OR and AND) may be: 

(1 ) evaluate all the tests; 
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(2) AND the test results which have lnterOp=AND, forming the InterAND result; 

(3) OR the test results which have lnterOp=OR, with the InterAND result, forming the InterOR result; and 

(4) the InterOR result is the final result for the parent test. 

5 [0234] In many cases, preference tests of the same type under a single parent will have a specific desired combination 
for those preferences before they are combined with the other different children of that parent. Each entry in the Mapping 
Table has a field that defines how this type of preference test should be combined with similar type preference tests 
under the same parent. This may be referred to as the intra group combinatorial operator (IntraOperator). Referring to 
FIG. 40, the parent test has four dissimilar children and four pairs of similar children. The similar children are first 

10 combined into intra groups using either respective IntraOperator. Then the intra group results are combined with the 
other groups and tests using their respective InterOperator. 

[0235] The rules for combining similar and dissimilar tests (with the operator mappings of OR and AND) may be, for 
example: 

'5 (1 ) evaluate all the tests; 

(2) group together test results that have the same name, forming Intragroups; 

(3) AND the test results within Intragroups which have lntraOp=AND, forming IntraAND results; 

(4) OR the test results within Intragroups which have IntraOp^OR, forming IntraOR results; 

(5) AND all the solitary test results (not part of an Intragroup) and Intragroup results which have lnterOp=AND, 
20 forming the InterAND result; 

(6) OR all the solitary test results and Intragroup results which have lnterOp=OR, with the InterAND result, forming 
the InterOR result; and 

(7) the InterOR result is the final result for the parent test. 

25 [0236] The general case of intra group combinations shown in FIG. 40 has the special default case shown in FIG. 
41 . This simplified approach supports the most common intra group operation, OR'ing and the most common inter 
group operation, AND'ing. One of the advantages of the approach of FIG. 40 is that the most common operations are 
supported with reduced constructs, and other combinations are supported by duplicating the hierarchy as necessary. 
The default mapping allows field updates to the User Preferences Description schema without requiring a change to 

30 the application or Mapping Table. 

[0237] An example of a default mapping may include defining a parent (e.g., node) in the user preference that maps 
to a parent (e.g., node) in the Program Description and setting a "default" comparison between the two. In the event 
that an additional child user preference is added to the parent in the hierarchal tree then this child is automatically 
mapped to a corresponding child of the parent in the hierarchal tree of the Program Description. Preferably the two 

35 children have the same name to simplify identification. 

[0238] The example illustrated in FIG. 42 show the four permutations for combining two leaf tests that may be of 
type A or B, namely, AND dissimilar types, OR similar types, OR dissimilar types, and AND similar types.. In addition, 
these leaf tests could also be parent tests themselves, composed of their own sub-hierarchy. The simplified approach 
relies on there being a layer (below the topmost layer) that has IntraOperator AND, while the rest of the hierarchy 

*o further down uses IntraOperator OR. This supports combining similar or dissimilar tests by merely going up in the 
hierarchy a sufficient number of levels. 

[0239] The location mappings described in the Mapping Table yield global paths that start from the root node in the 
Program Description ("/Program"). Some preference tests may require support for a relative path. A special form of 
the InterOperator AND is defined which constrains a group of tests to be performed on the same element or parent 

45 element in the Program Description. This is defined as a Constrained-AND (CAND) combinatorial operator. 

[0240] The constrained operation has a base path and multiple tests. The base path defines the starting node for all 
the predicate tests to be performed from. In the general example illustrated in FIG. 43, the parent of the three Con- 
strained-AND tests is "P". The mapped location for"P" in the Program Description is the base path, which resolves to 
"r/s/t/p". Therefore, for each instance of "r/s/t/p" in the Program Description, the children elements "x" and "y/z" are 

50 tested by "X", and the two "Y" tests. 

[0241] A user trying to find programs on stuffed pasta might create the following profile fragment: 



55 



54 



BNSDOCID: <EP 1 189437 A2_l_> 



EP1 189 437 A2 



<KeywordPreferences xml:lang=en> calzone </KeywordPreferences> 

5 <KeywordPreferences xml:lang=en> tortelini </KeywordPreferences> 

<KeywordPreferences xml:lang=en> ravioli </KeywordPreferences> 

The word calzone is a type of stuffed pasta in English, but it is underwear in Spanish. Without the use of Con- 
10 strained-AND, the agent may erroneously retrieve programs such as 



<Creation> 

<Crcation Description xml:lang=en> Victoria Secrets models women's undcrwear</CrcationDescrition> 
<Creation Description xml:lang=es>El Secreto de Victoria se demuestra los calzoncs dc mujer</CreationDescrition> 
</Crcation> 

20 [0242] The example shown in FIG. 44 depicts the Constrained-AND operator used for the Creator preference. A 
special syntax in the Mapping Table indicates that the base path is "/Program/Creation Metal nformation/Creation//Cre- 
ator". Therefore, the predicate tests are performed against the node instances of "Creator". The first Program Descrip- 
tion examples passes this Constrained-AND test, while the second fails. Notice that the second Program Description 
would have passed a test that was composed of regular AND operators and their global paths. 

25 [0243] As shown in FIG. 44, if all the CAND's were all regular AND's, then the User Preference would be asking: 

(1 ) Are there any nodes of path 

"/Program/Creation Metalnformat ion/Creation/Creator/ 
role" that have value matching "Author" AND; 
30 (2) are there any nodes of path 

"/Program/Creation Meta Information/Creation/Creator/ 
— Individual/FamilyName" that have value matching 
"martin" AND; 

(3) are there any nodes of path 
35 "/Program/Creation Meta Information/Creation/Creator/ 

Individual/FamilyName" that have value matching "Martin". 

This test would pass both Program Descriptions shown in the example. 

[0244] As shown in FIG. 44, with the defined CAND's, the User Preference is asking: 

40 

(1) are there any nodes of path 

"/Program/CreationMetalnformation/Creation/Creator", then, at each instance of these nodes; 

(2) are there any child nodes of path "role" that have value matching "Author" AND; 

(3) are there any child nodes of path "lndividual/FamilyName n that have value matching "Martin" AND; 
45 (4) are there any child nodes of path "individual/GivenName" that have value matching "Steve". 

This test would only pass the first Program Description. This illustrates that the user of AND and CAND operators on 
the same program description may result in different results. 

[0245] Referring to FIG. 45, a general example demonstrates Inter Operators, Intra Operators, and Constrained 
so Operators. The rules for combining the tests may be, for example: 

(1) group together tests which have lnterOp=CAND, forming the CAND group; 

(2) determine the base path for the CAND group from the lowest common Program Description path indicated in 
the Mapping Table; 

55 (3) for each path instance in the Program Description that is equivalent to this base path, evaluate all the tests 

within the CAND group, from this path instance; 

(4) evaluate all the tests not within the CAND, from the root path; 

(5) group together test results within Intragroups which have lntraOp=AND, forming IntraAND results; 
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(6) AND the test results within Intragroups which have lntraOp=AND, forming IntraAND results; 

(7) OR the test results within Intragroups which have lntraOp=OR, forming IntraOR results; 

(8) AND the solitary test results (not part of an Intragroup) and Intragroup results which have lnterOp=CAND, 
forming the InterCAND result; 

(9) AND the solitary test results and Intragroup results which have lnterOp=AND, with the InterCAND result, forming 
the InterAND result; 

(10) OR the solitary test results and Intragroup results which have lnterOp=OR, with the InterAND result, forming 
the InterOR result; and 

(1 1 ) The InterOR result is the result for the parent test. 



[0246] An illustrative example of one embodiment of the technique described herein includes the example illustrated 
in FIG. 46 together with the resulting Mapping Table illustrated in FIGS. 47A-47C. It is noted that the default for InterOp/ 
IntraOp operations are AND/OR. It is also noted that the preferences just below the highest level (Creation Preferences, 
ClassificationPreferences, Summary Preferences) are AND/AND. Also some of the composite preferences such as 
is Creator have child preferences that are CAND/OR. Further, the multiple Filtering And Search Preferences may be 
distinguished by Preference Type attributes. The IntraOp for multiple Filtering And Search Preferences and multiple 
Browsing Preferences is specified. 

[0247] The multiple User Preference elements may contain a ranking attribute. Such ranking attributes may be ap- 
plied at each comparison test and each combinatorial operation, to yield a composite ranking score, this may be utilized 

20 to provide a sorted test result for the user. 

[0248] Referring to FIG. 48, the user preference hierarchy of (name : value) pairs may be supplemented with at- 
tributes regarding the intended combination of individual preference tests. As shown in FIG. 48, the supplementation 
may indicate that the two tests of type A should be AND'd together and this result OR'd with the other tests under this 
parent. The primary advantage of this supplemental enhancement is that the user preference description may override 

25 the default behavior for specific tests. This makes interpretation of the user preference description more flexible so 
that it may be tailored to the user's specific preference criteria. 

[0249] The discrete implementation of the filter agent will yield as output a group of program descriptions that are 
merely members of the input set. The output group may actually just be a list of the input Program Descriptions that 
passed the selections. However, there can be components of the User Preference Descriptions that are well suited to 
30 extract a subset of the whole Program Description that yields an output more tailored to the user's preference criteria. 
For instance, the user may request a maximum number of key frames in order to prevent overloading the bandwidth 
capabilities of their system. 

[0250] The process of cloning the selected input Program Descriptions and modifying them to include a particular 
desired subset by the user may achieve enhanced benefits. The modified Program Description is a clone of the input 

35 because it refers to the same base set of Program Media. However, it is modified to refer to the subset of the Program 
Media that is desired by the particular user. In some cases this may result in smaller quantity of the program being 
available. In other cases, this may result in different summaries of the program, though it refers to the full program. 
[0251] The cloned Program Description provides a more succinct representation of what the user prefers. In this 
manner, it may not be necessary to annotate or provide additional identifiers to describe what the user actually desires. 

40 [0252] In a modular implementation, the filter agent may not be closely coupled with the media manager and the 
presentation processes. In this case, the cloned Program Description offers a standardized format for describing the 
desired program, without having to create a new syntax or an application programming interface (API). 
[0253] The cloned Program Description may also be used to create a "pull" for Program Media that will yield only 
the desired portions of the media. This provides a convenient technique for a media provider to provide to the user 

4 5 only that specific media that is desired. 

[0254] A service provider may likewise provide service to the user according to the user's preference where service 
includes a modified cloned program description. The cloned description may be a subset of the complete "rich 1 ' program 
description that is usually maintained by the service provider. The clone may contain varying levels of "richness". This 
permits the provider to offer various service levels to its clients. 

50 [0255] The cloned Program Description also allows the customer and/or service provider to tailor the amount of 
material that will be transmitted to the customer. This enables the quantity of material to be matched to the available 
memory in the client device and the available bandwidth of the delivery channel. 

[0256] The cloned program descriptions may provide a memory efficient way of storing descriptions of selected 
programs in the client's local storage. 
55 [0257] One technique to achieve cloning is cloning by "addition", as illustrated in FIG. 49. The core elements of a 
Program Description are identified and copied into the clone. These items typically include the root "/Program" element, 
the "/Media Information", etc. To this core set, the extractor adds the desired components for the user. Also, some 
adjustment may be necessary to resolve interdependences of the extracted elements. For example, the Program 
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Description may contain elements or groups of elements that are extensions or refinements of other elements in the 
Program Description. These extension elements may refer to the base elements without actually duplicating the base 
elements. In this instance, if the extractor should extract the extension elements but not the base elements then all the 
base elements must be inserted into the closed Program Description to make it accurate. 

5 [0258] Anothertechnique to achieve cloning is cloning by "deletion", as illustrated in FIG. 50. The entire input Program 
Description is cloned for the output. The extractor then builds a list of the desired components that are to be retained 
in the output. Then, optional elements are identified in the clone. If these optional elements are not included in the list 
of elements to be retained, then they are deleted. Thereafter, some adjustments may be necessary to resolve inter- 
dependencies of the extracted elements, as previously described. One advantage of cloning by deletion over cloning 

10 by addition is that it is less susceptible to changes in the Program Description. Items that are not explicitly listed as 
being optional will be passed on. In effect, this method errs on the side of passing too much data, while cloning by 
addition errs on the side of passing too little. 

[0259] A vast amount of audiovisual material exists from which the user may select appropriate audiovisual materials 
that may be of interest. However, there needs to be developed effective techniques to determine which audiovisual 

is materials are most likely appropriate for a particular user. Typically these techniques include the use of an agent that 
compares in some mannerthe user's preferences to the audiovisual content. Existing agents typically offer rudimentary 
preference weighting techniques based upon preference items such as title, keyword, author, and cast. The weighting 
scheme determines to what extent a particular set of user preferences matches the description of the different audio- 
visual materials, such as a binary yes/no determination. After determining the extent to which the user preferences 

20 matches a particular audiovisual material an overall score is calculated. After computing the overall score for each of 
the audiovisual materials they may be ranked in order from which the user may select desirable material. However, 
the use of such a technique makes it difficult to distinguish between programs that are strongly desired versus fringe 
programs that the user may be rarely interested in. An effective agent should include a technique for identifying priority 
interests and a mechanism for sorting the priority interests. In essence, the audiovisual content should be distinguished 

25 jn a meaningful manner. 

[0260] Referring to FIG. 51 . the input to a filter agent 600 may include program descriptions 602 relating to one or 
more audio, video, or audiovisual (collectively referred to as audiovisual without limitation) materials 604. The filter 
agent 600 also receives user preference descriptions 608, typically from a user agent 610, which may be hierarchical 
if desired. The filter agent 600 based upon the user preference descriptions 608 and the program descriptions 602 

30 provides selected audiovisual materials 606, which may be rated if desired. The selected audiovisual materials may 
be provided to a user agent 610 and then to the user, if desired. The user agent 610 may create the user preference 
profile. Alternatively, the user or other party may utilize an authoring tool or automatic profile generator to create the 
. user preference profile. 

[0261] Referring to FIG. 52, a collection of related preferences may form a single preference template 612 (e.g., 

35 comedies with Eddie Murphy). The user agent 61 0 may create a group of one or more preference templates 612 that 
are evaluated to present the user with filtered and ranked programs. Each component of a preference template that 
may carry a preference value (PV) attribute may be referred to as a preference element (or node) 614. A container (or 
parent) 616 preference element has children preference elements 61 8. A parent may also have an individual preference 
test. Leaf preference elements 620 are those that do not have children and may refer to an individual preference test 

40 (e.g., genre is comedy). 

[0262] Referring to FIG. 53, the values defined by the preference value may take one or more values, such as for 
example, nominal value 620, neutral value 622, maximum value 624, and minimum value 626. It is to be understood 
that the preference values may take on other values. The neutral value 622 is preferably zero or some other predefined 
value representative of a neutral preference or indifference to programs with the associated preference. Typically, 

45 preference values with a neutral value (or null set) are not included in the resulting classification or scoring of the 
associated audiovisual content. The nominal value 620 preferably has a value otherthan zero, or some other predefined 
value, representative of a desire or disdain for programs with the associated preference. In essence, the nominal value 
620 indicates which programs are desirable by the user and which programs are not desired by the user. In this manner, 
programs which are desirable are more likely provided to the user, while programs that contain disdained content will 

50 be more likely not provided to the user. The nominal value 620 may be a constant value or otherwise a range of values 
to indicate a relative desire or disdain for particular content. Preferably, the user agent 610 or filter agent 600 provides 
a default value for the nominal values 620, so that the user does not have to explicitly define each of the nominal 
values. The user may define any of the nominal values 620, as desired. Preferably, there is not a predefined maximum 
value 624 or minimum value 626. Conceptually, a maximum value may represent that the user agent wants programs 

55 with a particular preference to always be selected and ranked highest. Similarly a minimum value might represent that 
the user agent wants programs with a particular preference to always be rejected. In both of these cases the user 
agent may effectively simulate the maximum and minimum concept by selecting the value appropriately (e.g., +/- 
1,000,000). 
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[0263] The user preference description may include a hierarchy of preferences, many of which include preference 
value attributes. When each "individual preference" is evaluated against the corresponding information from the pro- 
gram descriptions a score is calculated for that individual preference. In one embodiment, the hierarchy of preferences 
for an individual hierarchy may be evaluated by creating a composite score from the aggregation of individual scores 
5 within the hierarchy. The resulting composite score is then compared against other composite scores for other program 
descriptions to determine a relative ranking. The composite score for a particular program description may be deter- 
mined free from consideration of other program descriptions, if desired. 

[0264] While a composite score provides a relatively good measure for the desirability for any particular media, 
especially when compared against other composite scores, the present inventor determined that the resulting relative 

10 composite scores may be misleading of the desirability of the content. For example, a particular audiovisual program 
may have only a limited occurrence of a particular event or item : in which case the composite score will likely consider 
that limited occurrence as being fully present in the program. However, a different audiovisual program may have 
frequent occurrences of a particular event or item, in which case the composite score will likely merely consider that 
frequent occurrence in the same manner as the limited occurrence. Accordingly, it may be preferable to rank programs 

15 at an intermediate level, described later. Also, it is not preferable to combine the preference values into a single com- 
posite preference value. Instead, each score, which is evaluated using its associated preference value, is combined 
into a composite score. When examining a user preference, it may be useful to combine one or more of the preference 
values, but this is actually combining the resultant scores when the preference is found to match a corresponding 
program description attribute. Also, it is likewise preferable not to compare a score against a preference value. Rather, 

20 the score is the result of the actual test considered with the preference value, and this score should be compared 
against other scores or against implementation-fixed thresholds. 

[0265] There are preferably at least two distinct processes occurring when the filter agent 600 processes a user 
preference 608. One process is the filtering of programs (pass or reject). The other process is the scoring and ranking 
of the selected programs into an ordered list (e.g., from most preferred to least preferred). The ranking values may be 

25 any type of indication, as desired. These two processes may be implemented by a variety of functions (e.g. filtering- 
function G {Boolean-AND, Boolean-OR, etc.}, ranking-function G {MIN, MAX, SUM, AVG, etc.}). These two processes 
may be distinct, such as, filter then rank, or rank then filter, or they may be integrated, namely, simultaneously filter 
and rank. In the hierarchical combination of user preferences, each combinatorial operator (AND, OR, etc) preferably 
implements some form of a filtering function and a ranking function. There may be a family of varieties of the AND and 

30 OR operators (or other operators) which implement different filtering and ranking functions. 

[0266] Programs may be ranked according to their respective scores, which is a relative relationship. The preference 
values and scores may be scaled without changing the result of the filter agent 600. The definition of a zero neutral 
value (or other value) sets a point for the filtering of programs based on score results. Depending on the filtering 
function, programs with a score above zero (or other value) may be passed and programs with zero or a negative score 

35 (or other value) may be rejected. The def in ition of the nominal value does not set an absolute significance for preference 
values and scores. The nominal value merely sets the default value that may then be compared (relatively) to other 
preference values and scores. 

[0267] The preference values may likewise be more than simply a number to provide further refinement of the se- 
lection process. The preference values may take the form of relativistic operations that compare different portions of 
40 the user preferences and multiple program descriptions at a level less than the entire program description. In this 
manner, portions of the program descriptions may be compared to one another which provides significantly improved 
results. A set of scenarios are provided to illustrate exemplary comparisons that may be implemented by the filter agent 
600. 

[0268] Each comparison may vary according to any suitable criteria, such as, for example, the following criteria: 

45 

(1) Hierarchy of preferences includes a single branch or multiple branches that are being compared; 

(2) Combinatorial operators such as OR, AND, etc.; 

(3) Composite score versus independent evaluation determines whether all of the individual tests are compiled 
into one composite score for an entire program, or whether one or more branch tests of a program are evaluated 

50 and compared in a relative manner against one or more branch tests of other programs; and/or 

(4) Non-preference indicates a negative preference for a program by appropriate selection of preference values 
and resulting scores. 

[0269] Referring to FIG. 54, the test case for each example may be evaluated as follows: 

55 

(a) In each example, one or more individual preferences 640 may be provided. 

These are designated by lower case 'a', 'b\ 'c\ ... and preferably arranged in a hierarchy. 

(b) The value for each individual preference is the preference value. These are designated as 'PVa', 'PVb\ 'PVc'... 
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(c) One or more programs may have some measurable presence of the desired program feature 644. These are 
designated by upper case 'A', 'B\ 'C\ ... and are related to the corresponding individual preference (in lower case). 

(d) The outcome of each individual preference test 646 is an individual presence value. The individual presence 
value is multiplied by the preference value for that program feature to yield the individual test result. 

5 Any other type of comparison, or other operation, may be used as desired. 

(e) All or part of the resultant individual test results are preferably arranged in the same hierarchy as the preference 
template at block 648. 

(f) The individual test results may be combined at block 650. They may be combined across all levels to create 
one composite score, or they may be combined and evaluated at one or more levels to create sublists at each 

10 parent node. These sublists may be used as the basis for ranking in a manner independent of the ranking for the 

entire program, if desired. Sibling nodes may be combined in operations such as OR, AND, MAX, MIN, AVG, 
SUM,.... Programs with test results that fall below a threshold may be rejected before or after the results are 
combined. 

(g) The ranking of programs at block 652 may be performed by sorting the score results, such as the highest score 
is - ranked number one, etc. at block 654. 

[0270] Referring to FIG. 55, a single (or multiple) branch combining sibling leaf elements (or other elements) using 
an OR'ing combination is illustrated. The result is the same as a traditional OR operation, namely, if either is true then 
the result is true (or score obtained). 
20 [0271] Referring to FIG. 56, a single (or multiple) branch combining sibling leaf elements (or other elements) using 
a MORE IS BETTER combination, which may be implemented as one or more operations, is illustrated. 

Design Rule 1: The OR combinatorial operator means "I want programs with at least one of these listed items." 

Generally, the more that are found, the better and hence more should be ranked above less. 
25 Design Rule 2: The preferred ranking function for the OR combinatorial is the SUM function. 

Design Rule 3: An alternative ranking function for the OR combinatorial is the MAX function. The MAX function 

selects the greatest value, e.g., the most desirable. Other ranking functions may likewise be used. 

[0272] Test case description: If the user agent wants to see programs with dogs(A) or cats(B), then programs with 
30 dogs and cats should rank above programs with just dogs. 

[Test case example illustrated in FIG. 56] 

[0273] 

35 

Test = a OR b 
PVa = PVb = 1 
Program J (A=B=1) 
Program K (A=1,B=0) 

40 

[0274] The test is an OR'ing of individual preference 'a' or 'b', where 'a' and 'b' are testing for the presence of 'A' and 
'B\ Program 'J' has full presence of 'A' and 'B', and Program 'K' has full presence of 'A' and no presence of 'B'. 
[0275] Referring to FIG. 57, a single (or multiple) branch combining sibling leaf elements (or other elements) using 
a JUST SLIGHTLY MORE IS BETTER combination, which may be implemented as one ormore operators, is illustrated. 
45 [0276] Test case description: If the user agent wants to see programs with dogs(A) or cats(B), then programs with 
dogs and only a tiny amount of cats, should rank higher than programs with just dogs. Likewise, Program K would rank 
higher than Program J, if B=0.3 for Program K. 

[Test case example shown in FIG. 57] 

50 

[0277] 

Test = a OR b 
PVa = PVb = 1 
55 Program J (A=1 , B=0.01 ) 

Program K (A=1, B=0) 

[0278] Referring to FIG. 58, a single branch (or multiple) combining sibling leaf elements (or other elements) using 



59 



BNSDCCID: <EP 1189437A2J_> 



EP1 189 437 A2 



10 



15 



55 



a STRONG PREFERENCE IS BETTER, which may be implemented as one or more operators, is illustrated. 
[0279] Design rule 4: If a user agent has a strong preference for something, this should override nominal or weaker 
preferences. 

[0280] Test case description: If the user agent strongly wants to see programs with dogs(A) or nominally wants to 
see cats(B) or mice(C), then programs with dogs should rank above programs with cats and mice. 

[Test case example illustrated in FIG. 58] 

[0281] 

Test = a OR b OR c 
PVa = 4, PVb = PVc = 1 
Program J (A=1, B=C=0) 
Program K (A=0, B=C=1 ) 



[0282] Referring to FIG. 59, a single (or multiple) branch combining sibling leaf elements (or other elements) using 
a RANGE OF PREFERENCE AND PRESENCE YIELDING RANGE OF RANKING, which may be implemented as 
one or more operators, is illustrated. 

[0283] Design rule 5: The evaluation and combination of individual test results should be linear such that partial 
20 preferences and partial presences are ranked in a range from neutral preference/non-presence to full preference/full 
presence. 

[0284] Test case description: If the user agent strongly wants to see programs with bears(A) or nominally wants to 
see lions(B) or tigers(C), then programs with partial bears should rank the same, or higher, or lower than programs 
with full tigers and lions, depending on the preference values. Programs should be ranked linearly, or in any other 
25 manner, according to the PVs and the degree of presence. 

[Test case example illustrated in FIG. 59] 

[0285] 

30 

Test= a OR b OR c 
PVa = 4, PVb = PVc = 1 
Program J (A=0.4, B=C=0) 
Program K (A=0.5, B=C=0) 
35 Program L (A=0, B=C=1) 

Program M (A=0.1, B=C=1) 

[0286] Referring to FIG. 60, a single (or multiple) branch combining sibling leaf elements (or other elements) using 
an AND'ing combination is illustrated. The result is the same as a traditional AND operation, namely, if both are true 
40 then the result is true (or some value). 

[0287] Referring to FIG. 61 , a single branch (or multiple) combining sibling leaf elements (or other elements) using 
a MORE IS BETTER, which may be implemented as one or more operators, is illustrated. 

[0288] Design rule 6: The preferred ranking function for the AND combinatorial is the average function. This takes 
the average of the component test results to create a score that is used for ranking. 
45 [0289] Design rule 7: An alternative ranking function for the AND combinatorial is the minimum function. This takes 
the value of the lowest test result as the score for the combination. 

[0290] Design rule 8: When evaluating the AND combination, as with the OR combination, more preference and 
presence is typically better. 

[0291] Test case description: If the user agent wants to see programs with neural(A) and network(B), then programs 
so W jth full neural and full network should rank above programs with full neural and partial network. 

[Test case example illustrated in FIG. 61] 

[0292] 



Test = a AND b 
PVa = PVb = 1 
Program J (A=B=1) 
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Program K (A=1,B=0.5) 

[0293] Referring to FIG. 62, a single (or multiple) branch combining sibling leaf elements (or other elements) using 
a RANGE OF PREFERENCE AND PRESENCE, which may be implemented as one or more operators, is illustrated. 

5 [0294] Design rule 9: When evaluating the AND combination, as with the OR combination, the individual tests and 
the combination of individual test results should be linear (or any other type) such that partial preferences and partial 
presences are ranked in a range from neutral preference/non-presence to full preference/full presence. 
[0295] Test case description: If the user agent wants to see neural(A) and network(B), then programs with full neural 
and tiny network should rank same, or higher, or lower than programs with partial neural and partial network, depending 

10 on the presence and preference values. 

[Test case example illustrated in FIG. 62] 
[0296] 

15 

Test = a AND b 
PVa = PVb = 1 
Program J (A=B=0.6) 
Program K (A=1, B=0.1) 
20 Program L (A=B=0.5) 

[0297] Referring to FIG. 63, a single (or multiple) branch combining sibling leaf elements (or other elements) using 
a FILTER FIRST VERSUS SCORE FIRST, which may be implemented as one or more operators, is illustrated. 
[0298] Design rule 1 0: The preferred order of operation for the AND combinatorial is score then filter. In this order, 
25 the score for the AND combination is calculated and then if it is below some threshold, the program is rejected. 

[0299] Test case description: If the user agent wants to see artificial(A) and vision(B), then programs with full artificial 
and partial vision should rank above programs with partial artificial and partial vision which rank above programs with 
full artificial and no vision. 

30 [Test case example illustrated in FIG. 63] 

[0300] 

Test = a AND b 
35 PVa = PVb = 1 

Program J (A=1 , B=0.9) 
Program K (A=B=0.9) 
Program L (A=1, B=0) 

40 [0301] Referring to FIG. 64, an alternative single (or multiple) branch combining sibling leaf elements (or other ele- 
ments) using a FILTER FIRST VERSUS SCORE FIRST, which may be implemented as one or more operators, is 
illustrated. 

[0302] Design rule 11 : An alternative order of operation for the AND combinatorial may be filter then score. In this 
order, if a program has zero or less of some AND'd preference, then it is rejected, regardless of the rest of the scoring. 
45 if the score is propagated upward in the hierarchy to be used in other combinatorial operations, then the score should 
indicate neutral or non-preference (e.g. zero or negative value). 

[0303] Test case description: If the user agent wants to see artificial(A) and vision(B), then programs with full artificial 
and no vision should fail. 

50 [Test case example illustrated in FIG. 64] 

[0304] 

Test = a AND b 
55 PVa = PVb = 1 

Program L (A=1, B=0) 

[0305] Container preference elements may be evaluated and combined with other preference elements (either con- 
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tainer, leaf, or otherwise) in a variety of combinatorial operations. 

[0306] Referring to FIGS. 65 and 66, a multiple branch combining sibling leaf elements (or other elements) using a 
MULTI-BRANCH OR'ING, which may be implemented as one or more operators, is illustrated. 
[0307] Design rule 12: The OR combinatorial function implemented by SUM (or other functions) should combine all 
5 the sibling elements (or otherwise) the same, without regard to the number of siblings (or otherwise). 

[0308] User agent rule: If the user agent intends that the ratio of passing preferences should matter, then the agent 
should adjust the preference values accordingly. 

[0309] Design rule 1 3: An alternative ranking function for the OR combination would account for the ratio of passed 
components. 

10 [0310] Test case description: A user agent wants to see movies with as many actors from group-N as possible or as 
many actors from group-M as possible. If N={A,B I C,D) and M={E,F }, then the user agent may wish to see a movie 
with A, B, C ranked over a movie with E, F. 

[Test case example illustrated in FIG. 66] 

15 

[0311] 

Test = x OR y; x = a OR b OR c OR d; y=eORf 
PVx = PVy = PVa = PVb = PVc = PVd = PVe = PVf = 1 
20 Program J (A=B=C=1 , D=E=F=0) 

Program K (A=B=C=D=0, E=F=1) 

[0312] Test case description: A user agent wants to see movies with the highest percentage of actors from group-N 
or the highest percentage of actors from group-M. Illustrated in FIG. 67. If N={A, B, C, D) and M={E, F }, then the user 
25 agent may wish to see a movie with E, F ranked over a movie with A, B, C. 

[Test case example illustrated in FIG. 67] 

[0313] 

30 

Test = x OR y; x = a OR b OR c OR d; y = e OR f 
PVx = PVy = PVa = PVb = PVc = PVd = 1 
PVe = PVf= 2 

Program J (A=B=C=1 , D=E=F=0) 
35 Program K (A=B=C=D=0, E=F=1) 

[0314] Referring to FIGS. 68 and 69, a multiple branch combining sibling leaf elements (or other elements) using a 
COMPOSITE SCORING, which may be implemented as one or more operators, is illustrated. 

[031 5] Design rule 1 4: A preferred method for combining the children test results of a parent element (or otherwise) 
40 is to combine them into one composite score and pass this up to the containing grandparent element (or otherwise), 
as illustrated in FIG. 68. At the root composite element, this composite score is used to rank all the programs. This 
method may be referred to as composite scoring. 

[0316] Test case description: In a composite scoring combination, if the user agent partially wants to see westerns 
(X) that star Eastwood(A) or Wayne(B), or fully wants to see dramas (Y) with a sub-preference that is full for Gibson 
45 (C) or small for Cruise(D), then a western with Eastwood should rank higher than a drama with Cruise. The user agent 
is intending to seek for programs with the highest overall score for all their preferences. 

[Test case example illustrated in FIG. 69] 

50 [0317] 

Test = x OR y; x = a OR b; y = c OR d 
PVx = 0.8, PVy = PVa = PVb = PVc = 1 , PVd = 0.5 
Program J (A=1 , B=C=D=0) 
55 Program K (A=B=C=0, D=1) 

[0318] Referring to FIGS. 70-73, a multiple branch combining sibling leaf elements (or other elements) using a IN- 
DEPENDENT EVALUATION, which may be implemented as one or more operators, is illustrated. 
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[031 9] Design rule 1 5: An alternative method for combining the children test results of a parent element (or otherwise) 
is to rank all the programs for each of the children tests (or otherwise) separately. These sublists of rankings are then 
inserted, as a block, into a super list for the parent element, where each block is ranked according to the preference 
value of the child test. This method may be referred to as independent evaluation. 
5 [0320] Design rule 16: When sublists are inserted into super lists, the position of any program should assume the 
position that the program takes in the highest sublist that contains the program. (Only keep the highest position for 
each program.) 

[0321] Test case description: In an independent evaluation combination, if the user agent partially wants to see 
westerns (X) that star Eastwood(A) or Wayne(B), or fully wants to see dramas(Y) with a sub-preference that is full for 
10 Gibson(C) or small for Cruise(D), then a western with Eastwood should rank lower than a drama with Cruise. The user 
agent is intending to seek for dramas above all westerns. 

[Test case example illustrated in FIGS. 71, 72, and 73] 

15 [0322] 

Test = x OR y; x = a OR b; y=cORd 
PVx = 0.8, PVy = PVa = PVb = PVc = 1 , PVd = 0.5 
Program J (A=1 , B=C=D=0) 
20 Program K (A=B=C=0, D=1) 

[0323] Design rule 17: The OR'ing of sibling container preferences with equal PVs using independent evaluation is 
equivalent to using composite scoring. 

[0324] User agent rule: If the user agent intends to intermingle the ranked results across two branches (or otherwise), 
25 but also intends to rank one branch's results slightly higher than the other (or otherwise), then the agent can use 
composite scoring and adjust the PVs of the leaf tests (or otherwise) of the higher preferred branch to give this slight 
advantage, and the results will still be intermingled. 

[0325] Referringto FIG. 74, a multiple branch combining sibling leaf elements (or other elements) using a COMPAR- 
ING VARIOUS PVS ACROSS HIERACHY, which may be implemented as one or more operators, is illustrated. 

30 [0326] Design rule 18: In AND operations, creating branch sub-lists and then merging these lists should yield the 
same results as creating one composite list. Therefore "independent evaluations" are not relevant. All the components 
of the AND operation should be scored and these results should be combined into a composite score. 
[0327] Test case description: If the user agent is strongly interested in horses(A) or ostriches(B), and nominally 
interested in breeding(C) or grooming(D), then a program with partial horses and full grooming should rank lower than 

35 a program with full horses and partial grooming. 

[Test case example illustrated in FIG. 74] 
[0328] 

40 

Test = x AND y; x = a OR b; y = c OR d 
PVx = 2, PVy = PVa = PVb = PVc = PVd =1 
Program J (A=0.9, B=C=D=1 ) 
Program K (A=B=C=1 , D=0.9) 

45 

[0329] Referring to FIG. 75, a multiple branch combining sibling leaf elements (or other elements) using a UNQUAL- 
IFIED OR'ING OF NON-PREFERENCES MAY RETRIEVE LARGE QUANTITY OF RESULTS, which may be imple- 
mented as one or more operators, is illustrated. 

[0330] Design rule 19: The use of OR combination with non-preferences is a special case that should be used in 
50 conjunction with other AND'd preferences. If the non-preference is OR'd in the main branch, without being further 
qualified with another AND'd preference, this will tend to retrieve the majority of the programs available. OR'ing of non- 
preferences is generally only useful if this branch is qualified with another branch in an AND'ing combination. 
[0331] Referringto FIG. 76, a multiple branch combining sibling leaf elements (or other elements) using a QUALIFIED 
OR'ING OF NON-PREFERENCES IS PREFERRED, which may be implemented as one or more operators, is illus- 
55 trated. 

[0332] Design rule 20: The nature of OR'ing operations is such that individual members of the combination should 
not decrease the combined score, rather, they can only increase the score. When combining n on -preferences in an 
OR combination, the individual test result (negative value) should be translated into the positive preference range by 
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adding the individual preference value to the result. 

[0333] Test case description: If the user agent wants to see programs with "nature"(A) or without "city"(B), then a 
program with nature and city should be ranked lower than a program with just nature. 

5 [Test case example illustrated in FIG. 77] 

[0334] 

Test = a OR b 
10 PVa=1,PVb = -1 

Program J (A=B=1) 
Program K (A=1, B=0) 

[0335] Referringto FIG. 78, amultiplebranch combiningsibling leaf elements (or other elements) using a NON-PREF- 
15 ERENCE SCORE FIRST RESULTS IN ANY PRESENCE YIELDING LOWER RANKING, which may be implemented 
as one or more operators, is illustrated. 

[0336] Design rule 21 : The preferred order of operation for the AND combinatorial of non-preferences is to score 
then filter. In this case, the score for the AND combination is calculated and if the composite score is below zero, the 
program is rejected. 

20 [0337] Design rule 22: When the order of operation for positive preferences is filter-first and the order for non-pref- 
erences is score-first, then the programs are first filtered according to the presence/absence of positive preferences, 
then the score is calculated for all component preferences (positive and negative). This score is then used to again 
filter {reject programs below a threshold) and finally rank the programs. (This design rule is not demonstrated in the 
test cases below.) 

25 [0338] Test case description: If the user agent wants to see programs with "nature"(A) and without "city"(B), then a 
program with just a glimpse of city should pass lower than a program with just nature. 

[Test case example illustrated in FIG. 78] 

30 [0339] 

Test = a AND b 
PVa = 1 , PVb = -1 
Program J (A=1 , B=0.01 ) 
35 Program K (A=1 , B=0) 

[0340] Test case description: If the user agent strongly does not want to see city, then a program with just a glimpse 
of city should fail. 

40 [Test case example illustrated in FIG. 79] 

[0341] 

Test = a AND b 
45 PVa = 1, PVb = -100 

Program J (A=1 , B=0.01 ) 

[0342] Referring to FIG. 80, a multiple branch combining sibling leaf elements (or other elements) using a NON-PREF- 
ERENCE FILTER-FIRST MAY RESULT IN ANY PRESENCE YIELDING REJECTION, which may be implemented as 
50 one or more operators, is illustrated. 

[0343] Design rule 23: An alternative order of operation for the AND combinatorial of non-preferences is to filter-first 
then score. So if a program has the slightest amount of a non-preference, then it is rejected, regardless of the rest of 
the scoring. If the score must be propagated upward to be used in other OR statements, then the score should be zero 
or something negative. 

55 [0344] Test case description: If the user agent wants to see programs with "nature B (A) and without "city"(B), then a 
program with just a glimpse of city should fail. 
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[Test case example illustrated in FIG. 80] 

[0345] 

5 Test = a AND b 

PVa= 1, PVb = -1 

Program J (A=1 , B=0.01 ) 

[0346] The range and number of multimedia content available to users have increased at a staggering rate, rendering 
10 conventional methods of selecting such multimedia content, such as simple browsing, impractical. In order to provide 
a usable technique to select desirable multimedia, typically the system limits the set of choices available to the user 
by constructing and maintaining a user profile, which provides a compact description of a user's interests and personal 
preferences. This user profile is subsequently used to (a) filter input content, so that programs or items that the user 
has shown interest in are presented to the user, and/or (b) request from a content distribution service the programs of 
*5 interest. If desired, the user profile may be specified directly by the user, who explicitly states the descriptions of the 
programs he/she is interested in. Alternatively, user profiles can be automatically generated and updated to match 
content consumption behavior of users by recording and subsequently analyzing usage history information. Further- 
more, content providers (broadcasters and advertisers) can use the usage history information to accurately determine 
consumer response to, and ratings of, specific programs; to provide personalized content to individuals based on their 
20 preferences; and develop various content access, billing, and compensation models for consumers and content cre- 
ators/owners. 

[0347] Existing systems for selecting multimedia content focus on collecting a limited amount of information as part 
of the usage history. The list of available actions and content description items provided by these systems is not suitable 
for extension as new requirements and applications arise. Furthermore, there is a lack of standardized formats for 
25 representation of (multimedia) usage history information; hence the usage history data collected by a certain type of 
device or service cannot often be utilized directly by others. Additionally, usage history has traditionally been considered 
only as a tool for generating user preferences and profiles. Further, existing systems provide no technique to record 
usage history in terms of both individual user actions and detailed categorized lists. 

[0348] After consideration of the existing limitations with respect to the selection of multimedia content the present 
30 inventors developed a system for collecting and describing usage history information of one or more users in a much 
improved manner. The improved system allows non-intrusive observation of user actions over a period of time, enabling 
collection of consumption-related data without explicit user input. The period of time may be specified, modified, or 
otherwise dynamic, as desired. The collected usage history provides a list of the actions carried out by the user over 
a period of time, if desired, as well as statistical information with respect to the content descriptions, if desired. The 
35 content descriptions for the system may be custom for the particular system, standard multimedia descriptions, and 
may be in any format desired. In particular, descriptions may be in the form of a standard description (such as those 
defined by the MPEG-7, TV-Anytime Forum, ATSC-PSIP or DVB-SI) that accompanies the input content. The descrip- 
tions may also be provided as an auxiliary service, such as electronic program guides provided by cable services and 
Internet sites like Gist.com and TVGuide.com. 
40 [0349] The collected usage history information is preferably expressed in a compact, structured, and consistent 
format. These properties permit efficient and effective exchange of usage history information between various devices, 
platforms, applications, service providers, equipment and such, thereby increasing the likelihood of interoperability 
between these entities. 

[0350] The preferred implementation uses a description scheme/XML-based approach for describing the content 
45 usage history of a user over a period of time. The description schemes define a syntax and semantics for specifying 
usage history descriptions; i.e. description schemes for usage histories include a sets of rules to which an individual 
usage history description should comply. Descriptors are generally referred to as attributes of these descriptions. The 
use of a common set of description schemes and descriptors also enables interoperability; that is, different devices 
and systems are able to interpret usage histories that comply with the description schemes. At the same time, descrip- 
50 tion schemes do not need to prescribe completely how an application should use the information embedded in the 
description, for instance, applications are free to process a usage history description in various ways to generate a 
user profile. In all these aspects, the proposed system is different from existing systems, due mainly to the way of 
describing program and usage history information, and to the fact that it provides an exchangeable representation for 
interoperability. Further, the program descriptions may generate usage histories that are rather rich in content and 
55 structure. Furthermore, many different types of information can be associated with individual user actions, thereby 
allowing different implementations to include, configure, and customize the usage history data according to their needs 
without sacrificing interoperability. 

[0351] The present inventors determined that the system should include the concept of a "UserAction," which pro- 
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vides a compact yet generic and flexible representation mechanism for collecting usage history information. The list 
of UserAction types supported by the proposed system may be defined in terms of a thesaurus, and can thus be 
extended relatively easily. The thesaurus can include a diverse set of items and cover a multitude of different applica- 
tions. Thus the system may track a diverse set of user activities such as retail purchases, requests for pay-per-view 
5 programming, and the like. This approach provides a significance improvement over previously introduced methods 
which permit a very limited number of actions to be represented, and which cannot be easily extended to address 
various requirements. 

[0352] The time information associated with each user action may be defined in terms of the general time, which 
denotes the time of occurrence in Coordinated Universal Time (UTC); media time, which denotes the time information 
10 that is encoded with a piece of multimedia content; or both. This functionality allows the time information that is asso- 
ciated with such actions as "fast forward" or "rewind" to be provided accurately. 

[0353] The proposed framework facilitates the handling of a variety of multimedia content types, such as audio and 
video data, web pages, or content that is locally available to the consumer on his/her electronic equipment (e.g. a DVD 
player or Personal Video Recorder (PVR)). This functionality may be enabled by allowing any type of action and all 

15 major content categorization methods to be represented in the history. 

[0354] The system likewise facilitates the referencing functionality to allow different information sources to be asso- 
ciated with a particular action. This aspect of the proposed system allows the reference to a specific part of relevant 
content descriptions (such as the description of a segment that the user reviews in slow motion). Furthermore, content 
material related to an action (such as the content of a hyperlink that the user follows; the web site of a product that the 

20 user purchases; the electronic program guide which provides a listing of the programs available for the given time 
period; etc.) can be explicitly specified by virtue of this property. 

[0355] The system also allows usage history information to be captured at different levels of detail. For example, a 
usage history description can contain a detailed list of all the actions the user has performed over a period of time; or 
basic statistical information according to certain categorizations of content, such as language, country of origin, ratings, 
25 actors, etc.; or both. This flexibility enables even systems with limited resources (e.g. in terms of available storage 
space) to generate standardized, exchangeable usage histories. 

[0356] The structure of the usage history description scheme may be such that the captured usage history information 
can be arranged and presented in multiple different ways, in whatever form is most useful and efficient for the application 
that makes use of the history. For example, the usage history can be organized according to the type of user action 
30 (e.g. all "record" actions), program genre (e.g. usage history for sports programs), or even specific programs (e.g. 
usage history for the program "Seinfeld.") 

[0357] The design may respect a user's privacy by allowing a user to hide his/her identity from being revealed to 
third parties when the usage history is circulated or distributed. 

[0358] The description scheme-XML based framework enables the usage history descriptions to co-exist with other 
35 description schemes (e.g., those that are included in MPEG-7, for example, Usage Preferences Description Schemes) 
in the very same framework. These description schemes allow functionalities to the user so that the user can consume 
the content in ways that fits to his/her preferences, e.g., by consuming programs that are requested on the basis of a 
recurring preferred theme in the usage history. 

[0359] The collected usage history information may be personalized at multiple levels of granularity; i.e. it may be 
40 defined for multiple users (such as an entire family), or for a single user. This feature expedites many applications; for 
example, more detailed program rating information can be collected; parents can track their children's viewing habits 
and more easily control their access to objectionable content; the collected information can be used to generate more 
personalized programming; etc. 

[0360] The system is preferably free from relying on explicit user input; rather, it preferably functions in the background 
45 and collects data without prompting the user to fill out questionnaires or to respond to questions, which are often 
considered intrusive by users. 

[0361] While explicit user input about viewing habits are not required (as noted previously), the system may support 
a configuration layer or tool that the user can manipulate to define the types of activity and/or content information which 
he/she would not want the system to monitor or track. The configuration utility allows the user to provide the system 
50 with a list of activities approved for collection, which implies that the user has the ultimate control over the type and 
extent of the information collected. 

[0362] The preferred context of the system is depicted in FIG. 81 . The usage history process 700 has access to rich 
multimedia content descriptions, which may be available directly from broadcasters 702, electronic program guide 
providers 704, or other services or providers. These descriptions may contain meta data associated with the consum- 
55 able media. A user's actions are monitored and linked to the content descriptions by the usage history module 706. 
The generated usage history information, typically from the usage history module 706, may be utilized by a local profiling 
agent 708 to generate user preferences 710, which may in turn be used by a filtering agent 712 to assist the viewer 
in making program selections. Alternatively, the usage history information may be exchanged with a service provider 
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714, or be transferred to other devices or equipment, where it can be processed in various ways. Since the usage 
history is preferably expressed in a standardized format, exchange of this information is expedited and carried out 
without the need for further translation or modification. Using the configuration utility 71 6 the user can control the types 
of activities and/or content that are monitored, and modify the settings of the usage history and/or client appliance 

5 settings 718. The configuration utility 71 6 enables the user to have control over the information collected about him/her. 
[0363] Referring to FIG. 82, the preferred structure of the proposed system for collection and representation of usage 
history information is shown. The usage history process 700 has access to descriptions of the multimedia content that 
is consumed by the user. These content descriptions are typically generated by one or more commercial providers, 
and may be transmitted with, or separately frorrv the media content. The usage history process also may possess the 

10 ability to monitor the actions that the user performs on a variety of devices, such as A/V equipment, computer terminals, 
set-top boxes, personal digital recorders, and the like. The monitored actions can correspond to switching between 
channels; changing the audio/video settings of a device; recording a program; following a hyperlink; and so on. The 
set of actions monitored and logged by the usage history module is specified by the user through the configuration 
layer 730, which restricts the usage history module to collect information about authorized activities only. When ap- 

15 proved user activity is detected, usage history process 700 records, for the given action, the time of occurrence; the 
duration (if applicable); the unique identification of the program or multimedia content that the action is associated with; 
and/or additional content description information, in the UserActionHistory component 732. Additionally, usage history 
information can be represented as a categorized table using the UserChoiceHistory component 734. To achieve this, 
the content descriptions may be analyzed, and a predefined subset of these descriptions are recorded, preferably in 

20 tabulated format. The number of times a program or multimedia content of certain type (e.g. movies of "Western" genre; 
films that star "Kevin Costner," etc.) is consumed by the user is tracked by the UserChoiceHistory 734, which acts as 
a collection of categorized counters. In general, the usage history 700 and the configuration layer 730 are typically 
processes, whereas the other items typically only contain data. 
[0364] Referring to FIG. 83, the description schemes may be arranged as follows: 

25 

(a) UsageHistory Description Scheme 740; 

(b) UserActionHistory Description Scheme 732; and 

(c) UserChoiceHistory Description Scheme 734. 

30 [0365] The UsageHistory Description Scheme 740 serves as a container description scheme for the UserActionHis- 
tory Description Scheme 732 and the UserChoiceHistory Description Scheme 734, and describes the consumption 
history information for a given period of time. The UserActionHistory Description Scheme 732 contains a list of the 
different types of actions performed by the user during this period of time. The UserChoiceHistory Descriptino Scheme 
734 contains the history of user choices with respect to the descriptions associated with the consumed content. FIG. 

35 83 illustrates an entity-relationship diagram of the structure and interactions of these description schemes 

[0366] The UserActionHistory Description Scheme 732 contains multiple user action lists, each of which may provide 
a temporally ordered log of a specific type of action (such as "record" or "play"). Associated with every action are a 
program identifier that uniquely identifies the program or content for which the action is defined, and also one or more 
action data items, which can refer to any desired piece of the content description. 

40 [0367] The UserChoiceHistory Description Scheme 734 provides a counter-like functionality for categorizing each 
content item based on its description data. The principal purpose of this component of the UsageHistory description 
scheme 740 is to provide an alternative history representation that specifies general statistical information about the 
consumed content. The statistical information may be constantly or periodically updated with the descriptions that 
accompany the content consumed by the user. This representation is especially useful for long observation periods, 

45 when the storage requirements for logging of all user actions and (if desired) associated meta data may become 
inhibiting for some systems. The UserChoiceHistory Description Scheme 734 may include multiple parts, such as for 
example, five different parts: 

The Classification history 742, may provide a history of the user's choices with respect to classification descriptions 
50 of content; such as genre, language, and country of origin; 

The Creation history 744, may provide a history of the user's choices with respect to creation descriptions of 
content, such as title 746, location 74B, creator 750, date 752, and actor/director. 

[0368] The Source history 754, may provide a history of the user's choices with respect to source (such as its publisher 
55 or distributor) and format (such as coding type) descriptions of the content. 

[0369] The Summarization history 756, may provide the summarization-associated viewing/consumption history for 
a user. 

[0370] The Keyword history 758, may provide a history of the keywords the user has used while viewing/consuming 
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content. 

[0371] The UserChoiceHistory Description Scheme 734 preferably shares a similar structure with UsagePreferences 
description scheme in MPEG-7 described in MPEG-7, ISO/IEC CD 1 5938-5 Information Technology - Multimedia Con- 
tent Description Interface - Part 5 Multimedia Description Schemes, N3705, La Baule, France, October 2000, incor- 

5 porated by reference herein. The principal motivation for the similarity is to simplify mapping of the available history 
information to the standardized user preference description. However, this similarity is not intended to constrain the 
set of applications for user choice history, since the information in the description is generic and flexible enough to be 
utilized in anyway desirable. Moreover, the UserChoiceHistory may be structured in any suitable manner. 
[0372] It should be understood that the UsageHistory Description Scheme 740 serves as a structure to link all the 

10 pieces of information together. Various scenarios in different application environments exist in which not ail the various 
parts of the UsageHistory Description Scheme 740 are provided together in one description, but in other cases they 
may be. For example, in some cases only the UserActionHistory Description Schceme may be instantiated, or parts 
thereof, whereas in other only UserChoiceHistory Description Scheme might be utilized to describe the usage history, 
or parts thereof. Also, different descriptions may share description parts through the use of identifiers and identifier 

15 references. Different parts of the scheme proposed may exist in standalone descriptions. 

Usage History Description Scheme Syntax and Semantics 
[0373] 
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<!— ################################################ "> 
<!-- Definition of UsageHistory DS 



< 



> 



so <element name="UsageHistory M type= ,, mpeg7:UsageHistoryType , 7> 

<complexType name= ,, UsageHistoryType ,l > 
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<complexContent> 

<extension base= n mpeg7:DSType"> 

<sequence minOccurs-'O" maxOccurs= f, r , > 
<element name="UserIdentifier" 

type="mpeg7:UserIdentifierType" 
minOccurs= M 0" maxOccurs="r'/> 
<element name="UserActionHistory" 

type^'mpegTiUserActionHistoryType" 
minOccurs-'O" maxOccurs- 'unbounded M /> 
<element narne =fT UserChoiceHistory M 

type^'mpegTiUserChoiceHistoryType" 
minOccurs= M 0 M maxOccurs== M unbounded'7> 

</sequence> 

<attribute name="allowCollection rt type- 'boolean" 
use="default" value="true"/> 

</extension> 
</complexContent> 
</complexType> 

<!-- ################^ -> 
<!— Definition of UserActionHistory DS — > 

<!-- » ao»^ --> 
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<element name= ,, UserActionHistory M type="mpeg7:UserActionHistoryType7> 

<complexType name="UserActionHistoryType M > 
<complexContent> 

<extension base="mpeg7:DSType"> 

<sequence minOccxirs- '0 M maxOccurs= ,, r'> 
<element name- 'ObservationPeriod" 

type-'mpeg7:TimeType" 
minOccurs-' 1 " maxOccurs-'unbounded"^ 
<element name="UserActionList" 

type =,, mpeg7 :UserActionListType" 
minOccurs- ' 1 " maxOccurs =,l unbounded7> 

</sequence> 

<attribute name^'protection" type- 'boolean" 
use="default" value="true7> 

</extension> 
</complexContent> 
</complexType> 

<!-- Definition of UserActionList DS --> 

<!„ ################^ -> 
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<element name="UserActionList" type="mpeg7:UserActionListType'V> 

<cornplexType name="UserActionListType"> 
<complexContent> 

<extension base="mpeg7:DSType"> 

<sequence minOccurs="0" maxOccurs= ,, r , > 
<element name="ActionType" 

type= ,, mpeg7:Cont^olledTe^nType l, 
minOccurs= M 1 " maxOccurs=" 1 "/> 
<element name-'UserAction" 

type="mpeg7 :UserActionType" 
minOccurs-'O" maxOccurs="unbounded"/> 

</sequence> 

<attribute name= f, numlnstances" type^'nonNegativelnteger" 

use =,, optionar7> 
<attributc name="totalDuration ,, type='*mpeg7:durationType" 

use="optional7> 

</extension> 
</complexContent> 
</complexType> 

<!-- Definition of UserAction DS — > 
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<element name^UserAction" type="mpeg7:UserActionType7> 



<complexType name- TJserActionType"> 
is <complexContent> 

<extension base- , mpeg7:DSType"> 

<sequence minOccurs-'O" maxOccurs= n l"> 

<element name-^ctionTime'' minOccurs="0 M 

maxOccurs=" 1 M > 

<complexType> 

<sequence minOccurs="0" maxOccurs- ' 1 "> 
<element name j = l, ActionMediaTime" 



type="mpeg7:MediaTimeType ,, 
35 minOccurs="0" 

maxOccurs='T7> 

<element 

narae-'ActionGeneralTirne" 
typ e-'mpeg7 :TimeType" 
minOccurs^'O" 

maxOccurs= r, l ,f /> 

</sequence> 
</complexType> 

55 
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</element> 

<element name^'Trograjnldentifier'' 

type="mpeg7 :UniqueIDType'' 
minOccurs=" 1 " maxOccurs= M 1 "/> 
<elernent name-'ActionDataltem" 

type= ,, mpeg7:ReferenceType" 
minOccurs- '0 M maxOccurs= M unbounded ,, /> 

<ysequence> 
<textension> 
</complexContent> 
</complexType> 

<!-- Definition of UserChoiceHistory DS — > 

<- umiiiiiitiiiiiiiitiiiiiiiiiiiiiMMminiiiiinii!^ -> 

<element name= n UserChoiceHistory M type- , mpeg7:UserChoiceHistoryType , 7> 

<cornpIexType name="UserChoiceHistoryType"> 
<complexContent> 

<extension base="mpeg7:DSType"> 
<sequence minOccurs-T' maxOccurs= ,, l"> 

<element name- 'ObservationPeriod" type= ,t mpeg7:TimeType n 
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minOccurs= n l " maxOccurs= ,, unbounded"/> 
<element name= M ClassificationHistory n 
type="mpeg7:Classif]cationHistoryType" 
minOccurs="0" maxOccurs= M l"/> 

<element name=XreationHistory" type^'mpegTiCreationHistoo'Type" 
minOccurs= M 0 n maxOccurs =, T7> 

<element name="SourceHistory M type- 'mpeg7:SourceHistoryType n 
mmOccurs= ,, 0 M max0ccurs- ,, 17> 
<element name="SummarizationHistory" 
type="inpeg7: SummarizationHistoryType n 
minOccur-s= w CT maxOccurs- TV> 

<element name="KeywordHistory" type="mpeg7:KeywordHistoryType" 
minOccurs="0" maxOccurs="l"/> 
</sequence> 

attribute name= f, numTotalInstances" type="nonNegativeInteger"/> 

<attribute name^'protection" type="boolean" use="default H 
value="true"/> 
</extension> 
*«s/complexContent> 
</complexType> 



<!-- Definition of ClassificationHistory DS — > 
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<!-- 
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<complexType name= n ClassificationHistoryType ,, > 
<complexContent> 

<extension base="mpeg7:DSType"> 
<sequence minOccurs-T' maxOccurs- T'> 

<element name="CountryHistory" minOccurs-'O" 
maxOccuis= M unbounded"> 



<extension base="mpeg7:IS031 66- lCountryCode"> 

<attribute name-'numlnstances" type= ,, nonNegativeInteger*V> 
<attribute name^'totalDuration" type="mpeg7:durationType"/> 



</complexType> : r •* 

</element> 

<element name= ,, ReleaseDateHistory ,, minOccurs="0" 
r maxQccnf s=" unboxinded'^ 
<complexType> 
<extension base= f, nipeg7 :TimeType"> 

<attribute name-'numlnstances" type="nonNegativeInteger"/> 
<attribute name= ,, totalDuration ,f type- , mpeg7:durationType7> 



<complexType> 



<attribute name="id" type= H ID"/> 



</extension> 




<attribute name="id" type="ID , 7> 
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</extension> 
</complexType> 
</element> 

<element name="LanguageHistory" minOccurs= M 0" 
maxOccurs="unbounded"> 
<complexType> 
<extension base="LanguageType"> 

<attribute name= !, numlnstances n type= ,, nonNegativeInteger"/> 
<attribute name="totarDuration" type="mpeg7:durationType'7> 
<attribute name="id" type="ID'7> 
</extension> 
</complexType> 
</element> 

<element name="GenreHistory" minOccurs="Cr 
maxOccurs= H unbounded n > 

<complexType> 
<extension base^mpegTiGenreType" > 

<attribute name= ,f numInstances M type="nonNegativeInteger7> 
<attribute name= ,, totalDuration" type="mpeg7:durationType ,, /> 
Attribute name= ,, id n type="ID , V> 
</extension> 
</complexType> 
</element> 



76 



1189437A2_I_> 



EP 1 189 437 A2 

<element name= n MediaReviewHistory M minOccurs="0" 
maxOccurs="unbounded"> 
<complexType> 

<sequence minOccurs-T' maxOccurs=" 1 M > 
<element name="Reviewer" 

type= M mpeg7:PersonType7> 
<element name="RatingCriterion"> 
<complexType> 

<sequence minOccurs^" 1 " 
maxOccurs= ,f 1"> 
<element 

name="CriterionName !, 
type= M mpeg7:TextuaIType7> 
<element 

name^'WorstRating" 
type="integer7> 

<element name= n BestRating" 
type="integer7i> 

</sequence> 
</complexType> 
</element> 

<e]ement name- 'RatingValue" 

maxOccurs-'unbounded M > 
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<complexType> 

<simpleContent> 
<extension base-'integer 1 ^ 

<attribute name- 'numlnstances" 

type= M nonN egati velnteger7> 
<attribute name= ,, totalDuration M 

type="mpeg7 :durationType7> 
</extension> 
</simpleContent> 
</compIexType> 
</element> 
</sequence> 

<attribute name="numlnstances" type="nonNegativeInteger"/> 
<attribute name="totalDuration M type= M mpeg7:durationType ,, /> 

<attribute name="id" type="ID7> 
</compIexType> 
</e!ement> 

<element name^'TarentalGuidanceHistory" minOccurs- 'O" 

maxOccurs= , 'unbounded M > 
<cornplexType> 
<extension base="mpeg7:ParentalGuidanceType"> 

<attribute name- 'numlnstances" type= n nonNegativeInteger7> 
<attribute name="totalDuratiorr type="mpeg7:durationType'7> 
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attribute name="id M type="ID7> 
</extension> 
</complexType> 
</element> 
</sequence> 

<attribute name- 'numTotallnstances" type="nonNegativeIiiteger7> 
</extension> 
</complexContent> 
</compIexType> 

<!- Definition of CreationHistory DS — > 

<!-- mmmmmmmmmmmmmmmmmmmmm ^ -> 

<complexType name= M CreationHistoryType"> 
<complexContent> 

<extension base=="nipeg7:DSType"> 
<sequence minOccurs-T 1 maxOccurs="r'> 

<element name- Tit leHistory" minOccurs="0 n 

maxOccurs =,, unbounded M > 

<comp!exType> 
<extension base =,, mpeg7:TitleType"> 

<attribute name="numlnstances" type="nonNegativeInteger7> 
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<attribute name^totalDuration" type= , 'mpeg7:durationType'7> 
<attribute name= f, id H type= ,r ID , 7> 
</extension> 
</complexType> 
</element> 

<element name= n CreatorHistory" minOccurs="0" 
maxOccurs="unbounded n > 

<complexType> 
Extension base= M mpeg7:CreatorType n > 

<attribute name-'numlnstances" 

type="nonNegativeInteger , 7> 
<attribute name- UotalDuration" 

type-'mpeg7:durationType7> 
<attribute name^id" type="ID7> 
</extension> 
</complcxType> 
</element> 

<element name : ="LocatioTiHistory f1 minOccurs="0" 
maxOccurs="nnbounded ,, > 

<complexType> 
<extension base ss "mpeg7:PIaceType"> 

<attribute name="numlnstances" 
type-'nonNegativeInteger7> 



11 89437 A2_l_> 



80 



EP 1*1-89 437 A2 

<attribute name="totalDuration" 
type= M mpeg7:durationType7> 

</extcnsion> 
</complexType> 
</element> 

<element name : = ,, DateHistory ,, minOccurs- '0" 
maxOccurs= M unbounded H > 

<complexType> 
<extension base="mpeg7:TimeType M > 

<attribute name- 'numlnstances" 

type="nonNegativeInteger7> 
<attribute name- 'totalDuration" 

type="mpeg7 : durationTyp e"/> 
<attribute name="id" type="ID7> 
</extension> 
</complexType> 
</element> 
</sequence> 

<attribute name="numTotalInstances" type="nonNegativeInteger7> 
</extension> 
</complexContent> 
</complexType> 
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<!-- Definition of SourceHistory DS ~> 
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<complexType name="SourceHistoryType"> 
<coraplexContent> 

<extension base= M mpeg7:DSType M > 
20 <sequence minOccurs= M 1 maxOccurs- 1 1 M > 

<element name-TublicationTypeHistory" minOccurs="0" 
maxOccurs="unbounded"> 
<complexType> 
<extension base="mpeg7:ControlledTerraType ,, > 

<attribute name="numlnstances" type="nonNegativeInteger"/> 
<attribute name="totalDuration" type="mpeg7:durationType"/> 
attribute name^'id" type="ID M /> 
</extension> 
</complexType> 
</element> 

<elenient name- TublicationSourceHistory" minOccurs="0" 
maxOccurs= ,, unbounded t, > 
<complexType> 

<simpleContent> 

55 <extension base= n string"> 
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<attribute name= H numInstances" 

type= M nonNegativeInteger7> 
<attribute name-'totalDuration" type="mpeg7:durationType7> 
<attribute name="id" type="ID7> 
</extension> 
</simpleContent> 
</complexType> 
</element> 

<element name- TublicationPlaceHistory" minOccurs- *0 M 
maxOccurs= ,, unbounded"> 
<complexType> 
<extension base- , mpeg7:PlaceType"> 

<attribute narne-'numlnstances" type= ,, nonNegativeInteger' , /> 
<attribute name- 'totalDuration" type="mpeg7:durationType , V> 

</extension> 
</complexType> 
</element> 

<element name="PublicationDateHistory" minOccurs="0" 

maxOccurs="unbounded , '> 
<complexType> 
<extension base- , mpeg7:TimeType"> 

<attribute name="numlnstances" type="nonNegativeInteger7> 
<attribute name- 'totalDuration" type="mpeg7:durationType M /> 
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attribute name= ,, id M type="ID7> 
</extension> 
</complexType> 
</element> 

<element name-TublisherHistory" minOccurs- f 0" 
maxOccurs="unbounded"> 
<complexType> 
<extension base= ,l mpeg7:AgentType ,, > 

<attribute name-'numlnstances" type= f, nonNegativeInteger"/> 
<attribute name="totalDuration" type- 'mpeg7:durationType'7> 
</extension> 
<complexType> 
</element> 

<element name= w PublicatioriForniHistoiy" minOccurs="0" 

maxOccurs="unbounded ,, > 

<complexType> 

<attribute name= ,, formType ,, > 
<simpleType> 

Restriction base= w string ,, > 

Enumeration value="payPerVie\v"/> 
<enumeration value="payPerUse7> 
<enumeration value- 1 live'7> 
<enumeration value—'repeat"^ 
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</restriction> 
</simpleType> 
</attribute> 

<attribute name-'numlnstances" type="nonNegativeInteger7> 
<attribute name-'totalDuration" 
'njpeg7:durationType7> 

<attribute name="id" type="ID7> 
</complexType> 
</element> 

<element name^'MediaFormatHistory" minOccurs^'O" 
maxOccurs="unbounded ,, > 

<sequence minOccurs="l" maxOccurs=" 1 "> 

<choice minOccurs-' 1 " maxOccurs- 1 1 M > 
<element name^TileFormat" 

type="mpeg7;ControlledTennType"/> 
<element name="Medium" type= f, mpeg7:ControlledTermType n /> 
<element name="System" type= ! 'inpeg7:ControlledTermType7> 
<element name= M VisualCodingFormat" 
type=="mpeg7:ControlledTermType"/> 
<element name= lf AspectRatio" 

type= M mpeg7:ControIIedTennType , 7> 
<element name= M Color" t>pe="mpeg7:ControlledTermType'V> 
<element nanie= n AudioCodingForrnar 
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type="mpeg7:ControlledTerniType , V> 

<element name="AudioPresentation M 
type= n mpeg7:ControlledTermType7> 
</choice> 
</sequence> 

<attribute name- ? numlnstances" type= ,, nonNegativeInteger , V> 
<attribute name= !, totalDuration" type- ,, mpeg7:durationType , 7> 

<attribute name= r, id" type="ID , V> 
</element> 
</sequence> 

<attribute name- 'numTotallnstances'' type^'^onNegativelnteger"^ 
</extension> 
</complexContent> 
</coniplexType> 

<complexType name =,, KeywordHistoryType"> 
<complexContent> 

<extension base= M mpeg7:DSType ,# > 
<sequence minOccurs-T' maxOccurs=" 1 "> 

<eleraent name="KeywordItem" minOccurs="0" maxOccurs- 'unbounded'^ 
<comp lexType> 

<extension base~'mpeg7:KeywordAniiotationType' , > 

<attribute name="numlnstances" type= n nonNegativeInteger n /> 
<attribute name="totalDuration" type- , mpeg7:durationType7> 
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<attribute name^'id" type="ID'7> 
</extension> 
</complexType> 
</element> 
</sequence> 

<attribute name="numTotalInstances M type= tt nonNegativeInteger ,, /> 
</extension> 
</complexContent> 
</complexType> 

<!-- mmmmmmmmmmmmmmmmmmm 

<!-- Definition of SummarizationHistory DS — > 

<!- ########^^ --> 

<complexType name="SummarizationHistoryType"> 
<cornplexContent> 

<extension base="mpeg7:DSType"> 
<sequence minOccurs="l" maxOccurs— 1 ! 1"> 

<element name- 'SummaryComponentHistory" min Occurs- f 0" 
rnaxOccurs="unbounded"> 
<complexType> 
<extension base="mpeg7:SummaryComponentType n > 

<attribute name= f 'nunilnstances" type= M nonNegativeInteger ,f /> 
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<attribute name= M totalDuration" type= ,1 mpeg7:durationType , V> 
<attribute name^'id" type="ID"/> 
</extension> 
</complexType> 
</element> 

<element name- 'Summary ThemeHi story" minOccuTS-'O" 
maxOccurs=="unbounded"> 
<complexType> 
<extension base- f mpeg7:TextualType M > 

<attribute name="numTotalInstances" type="nonNegativeInteger'V> 
<attribute name— 'protection" type^'boolean" use="default'7> 
attribute name="id" type= ,, ID ,, /> 
</extension> 
</complexType> 
</element> 

<element name="SummaryDurationHistory" minOccurs="0" 
maxOccurs=="unbounded f, > 
<complexType> 

<sequence minOccurs=" 1" maxOccurs= ,f i "> 
<element name="MinNumSunimaryFranies" 
type="nonNegativeInteger" 
minOccurs= ,f 0" maxOccurs='TY> 
<element name- 'MinSummaryDuration" 



88 



EP1 189 437 A2 



type~"mpeg7:durationType" 
minOccurs^'O" maxOccurs='T7> 
<element naine="MaxNumSiimmaiyFrames" 
type= n nonNegativeInteger M 
minOccurs^'O" maxOccurs=='T7> 
<element name- 'MaxSummaryDuration" 
type="mpeg7:durationType ,f 
minOccurs= ,, 0 H maxOccur5="17> 
<element name= ,t AvgNumSumma^yFrames ,, 
type="nonN egati velnteger" 
minOccurs-'O" maxOccurs='T7> 
<element name="AvgSum^la^yDll^ation ,l 

type^'mpegyidurationType" 
minOccurs-'O" maxOccurs- '17> 
</sequence> 

<attribute name^'id" type="ID7> 
</complexType> 
</element> 
</sequence> 

<attribute name^numTotallnstances'' type="nonNegativeInteger7> 
</extension> 
</cornplexContent> 
. </complexType> 
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UsageHistory DS Semantics 


Name 


Definition 


UsageHistoryType 


Specifies user's multimedia content consumption history. 


Userldentifier 


Identifies the individual for whom the usage history is provided specifies. This element is of 
type UserldentifierType, which is described as part of UserP references DS, and contains the 
protected attribute. Thus the identity of the user is not disclosed unless this attribute is set to 
false. 


UserActi on History 


Describes history of the actions the user has carried out during the observation period. See 
below for the specification of UserActionHistoryType. 


UserChoiceHistory 


Describes a categorized listing of the set of choices the user has made while consuming 
content. See below for the specification of UserChoiceHistoryType. 


AllowCollection 


This attribute specifies whether the user allows his/her usage history data to be collected. 
The default value for allowCollection attribute is true. 



UserActionHistory DS Semantics 


Name 


Definition 


UserActionHistoryType 


Specifies a history of the actions carried out by the user. 


ObservationPeriod 


Describes the time period(s) during which the associated history items have been 
recorded. Multiple instance of ObservationPeriod can be used to represent discontinuous 
time periods 


UserActionList 


Describes a list of actions of the same type, i.e. all actions in the UserActionList carry 
the same ActionType value. See below for the specification of UserActionListType. 


protection 


This attribute specifies whether the given UserActionHistory information is to be 
protected from 3rd parties, or disclosed freely. 



35 


UserActionList DS Semantics 




Name 


Definition 




UserActionListType 


Specifies a list of user actions, with associated semantics. 


40 


ActionType 


Denotes the specific action performed by the user, such as "View, 0 "Pause," "Play," etc. 
Defined as ControlledTermType to provide extensibility for different actions. All actions in an 
ActionList have the same ActionType. A list (dictionary) of possible values of ActionTypeis 
shown below. 


45 


UserAction 


Characterizes a specific user action in the list. Each action is associated with a single 
program. See the specification of UserActionType shown below for more detail. 


num Instances 


Specifies the number of UserAction elements in a UserActionList. (e.g. 21 "Record" actions; 
5 "View" actions; etc.). 


50 


totalDu ration 


he total time spent by the user during the observation period performing a specific action (e. 
g. 32min for "Record" action). 



[0374] A list of actions that define an exemplary set of values for ActionType elements is shown. Each term has a 
numeric identifier, listed in the first column, and a textual label, listed in the second column. A description of each term 
is listed in the third column. This is one example of a thesaurus for ActionType of usage history. 



Term 


Label 


Description 


1 


Audio-Video 


Actions Related to Audio and Video 
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(continued) 





Term 


Label 


Description 


5 


1.1 


PlayRecording 


Play content from a recording 




1-2 


PlayStream 


Play content from input stream 




1.3 


Record 


Record input stream to local storage media 




1.4 


Preview 


View or listen to a summary of the input stream 


10 


1.5 


Pause 


Pause the input stream 




1.6 


FastForward 


Fast forward the input stream 




1.7 


Rewind 


Rewind the input stream 


15 


1.8 


Skip Forward 


Skip forward over a portion of the input stream 


1.9 


SkipBackward 


Skip backward over a portion of the input stream 




1.10 


Mute | 


Turn sound off 




1.11 


VolumeUp 


Increase volume 


20 


1.12 


VolurneDown 


Reduce volume 




1.13 


Loop/Repeat 


Repeat/loop (part of) the input stream 




1.14 


Shuffle 


Randomly select next track 


25 


1.15 


SkipToEnd 


Go to the beginning of the stream 




1.16 


SkipToStart 


Go to the end of the stream 




1.17 


CopyCD 


Copy all or part of a CD 




2 


Video 


Actions Related to Video 


30 


2.1 


Zoom 


Zoom (in) to the on-screen image or sequence 




2.2 


SlowMotion 


View input stream in slow motion 




2.3 


CCOn 


Closed caption is on 


35 


2.4 


Step Forward 


Advance to next frame 




2.5 


Step Backward 


Return to previous frame 




3 


Data 


Actions Related to Miscellaneous Data 




3.1 


ClickThrough 


Follow an available link 


40 


3.2 


ScrollUp 


Scroll up in a web page/composite page 




3.3 


ScrollDown 


Scroll down in a web page/composite page 




3.4 


ViewGuide 


View program/resource guide 


45 


3.5 


SavePage 


Save web page/composite page 




3.6 


PrintPage 


Print web page/composite page 




3.7 


Search 


Search the web or local resources 


50 


3.8 


SubmitForm 


Submit a form with requested information 


3.9 


SubmitQuery 


Submit a query 




3.10 


Archive 


Archive content to persistent local storage media 




4 


Commerce 


Actions Related to Commerce 


55 


4.1 


Buy 


Purchase a product or item 




4.2 


AddToWishList 


Designate a product or item as possible future purchasing item 




4.3 


AddToCart 


Designate a product or item as potential immediate purchase item 
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UserAction DS Semantics 


Name 


Definition 


UserActionType 


Specifies a detailed description of an action, with associated semantic. 


Actio nTime 


Specifies the time that the action took place and, if applicable, its duration (e.g. for "Play," 
Pause, etc.). The time of occurrence ot tne action can ue aescriDea in iwo wdyb. uy 
ActionMediaTime and/or by ActionGeneralTime. The duration of a UserAction refers to the 
duration in terms of the media time, which is identical to the duration in UTC for a large 
number of action types, but may be different for such action types as "Repeat" or 
"Fast Forward." 


ActionMediaTime 


Action time relative to the time reference established for the given media. This time 
referencing method is useful for such action items as "Repeat" or "FastForward," and for 
manipulating conteni on me users loudi system \buun eit> pei&vjnai ui wvuo/. 


ActionGeneralTime 


Action time relative to Coordinated Universal Time (UTC) in the Gregorian date/time format. 


Programldentifier 


Unique identifier of the program that is associated with the given action. Each Action is 
associated with a single program and, consequently, a single Programldentifier. 


ActionDataltem 


Refers to a specific part of the description of the AV content, or to other material related to 
the action (e.g. the URL the user chooses to follow in an enhanced TV application). Shall 
refer to an id attribute of an AV content description instance. 



UserAction DS Semantics 


Name 


Definition 


UserActionType 


Specifies a detailed description of an action, with associated semantics. 


ActionTime 


Specifies the time that the action took place and, if applicable, its duration (e.g. for "Play," 
"Pause," etc.). The time of occurrence of the action can be described in two ways: by 
ActionMediaTime and/or by ActionGeneralTime. The duration of a UserAction refers to the 
duration in terms of the media time, which is identical to the duration in UTC for a large 
number of action types, but may be different for such action types as "Repeat" or 
"FastForward." 


ActionMediaTime 


Action time relative to the time reference established for the given media. This time 
referencing method is useful for such action items as "Repeat" or "FastForward," and for 
manipulating content on the user's local system (such as personal CDs or DVDs). 


ActionGeneralTime 


Action time relative to Coordinated Universal Time (UTC) in the Gregorian date/time format. 


Programldentifier 


Unique identifier of the program that is associated with the given action. Each Action is 
associated with a single program and, consequently, a single Programldentifier. : 


ActionDataltem 


Refers to a specific part of the description of the AV content, or to other material related to 
the action (e.g. the URL the user chooses to follow in an enhanced TV application). Shall 
refer to an id attribute of an AV content description instance. 



UserChoiceHistory DS Semantics 


Name 


Definition 


UserChoiceHistoryType 


Specifies a categorized listing of the set of choices the user has made while consuming 
content. 


Observation Period 


Describes the time period(s) during which the associated history items have been 
recorded. Multiple instance of Observation Period can be used to represent 
discontinuous time periods. 
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(continued) 



UserChoiceHistory DS Semantics 


Name 


Definition 


ClassificationHistory 


Describes history of the user's choices with respect to classification descriptions of 
content. 


Creation History 


Describes history of the user's choices with respect to creation descriptions of content. 


Source History 


Describes history of the user's choices with respect to source descriptions of the content, 
such as its publisher or distributor. 


KeywordHistory 


Describes history of the keywords the user has used while viewing/consuming content. 


SummarizationHistory 


Describes history of the media summaries the user has consumed during the 
observation period. 


numTotal Instance 


Total number of content items/descriptions considered in UserChoiceHistory 


protection 


This attribute specifies whether the given UserChoiceHistory information is to be 
protected from 3rd parties, or disclosed freely 



20 





ClassificationHistory DS Semantics 




Name 


Definition 


25 


Classification History-Type 


Specifies history of the user's choices with respect to classification descriptions of 
content. 


30 


Country History 


Describes usage history with respect to country of origin of the content. The attributes 
numlnstances and totalDuration specify usage statistics for a specific country 
instance. 


ReleaseDateH istory 


Describes usage history with respect to date of release of the content. The attributes 
numlnstances and totalDuration specify usage statistics for a specific release date 
instance. 


35 


LanguageHistory 


Describes usage history with respect to language of the content. The attributes 
numlnstances and totalDuration specify usage statistics for a specific language 
instance. 




GenreHistory 


Describes usage history with respect to genre of the content. The attributes 
numlnstances and totalDuration specify usage statistics for a specific genre instance. 


40 


SubjectHistory 


Describes usage history with respect to subject of the content. The subject classifies 
content from a point of view of types of program, without considering genre 
classification. The attributes numlnstances and totalDuration specify usage statistics 
for a specific subject instance. 


45 
50 


MediaReviewHistoryType 


A data type for description of usage history with respect to reviews of the content, with 

associated semantics: 

Reviewer: Reviewers/critics of the content. 

RatingCriterion: The rating criterion used in the review. RatingCriterion includes 
CriterionName and the rating scale (form WorstRating to BestRating 
RatingValue: The rating value assigned to the content, num I nstances/totalDu ration- 
Describe the usage statistics associated with the given RatingValue. 
numlnstances/totalDuration: Describe the usage statistics associated with the given 
MediaReviewHistory. 

id: ID of the MediaReviewHistory instance. 


55 


ParentalGuidance-History 


Describes user history with respect to parental guidance ratings of the content. The 
attributes numlnstances and totalDuration specify usage statistics for a specific 
parental guidance instance. 
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(continued) 



ClassificationHistory DS Semantics 


Name 


Definition 


numTotallnstances 


Total number of content items for which ClassificationHistory information has been 
generated. 



10 


SourceHistory DS Semantics 




Name 


Definition 


15 


SourceHistoryType 


Specifies history of the user's choices with respect to source (such as its publisher or 
distributor) and format (such as coding type) descriptions of the content. 


PublicationTypeHistory 


Describes usage history witn respect to publication meaium ot comeni, sucn as 
satellite broadcast, CD, etc. The attributes numlnstances and totalDuration specify 
usage statistics for a specific publication type instance. 


20 


PublicationSource-History 


Describes usage history with respect to publication source of the content, such as 
broadcast channels. The attributes numlnstances and totalDuration specify usage 
statistics for a specific publication channel instance. 


25 


PublicationPlace-History 


Describes usage history with respect to the place content is distributed from. The 
attributes numlnstances and totalDuration specify usage statistics for a specific 
publication place instance. 


PublicationDateH istory 


uescnDes usage nistory wim respect 10 ume/uaie oi corueru □isLriL/uuon. \ am iL>uit;t> 
numlnstances and totalDuration specify usage statistics for a specific publication date 
instance. 


30 


PublisherHistory 


Describes usage history with respect to publisher or distributor of content. The 
attributes numlnstances and totalDuration specify usage statistics for a specific 
publisher instance. 


35 


PublicationFormHistory 


Describes usage history with respect to the form in which content in available. The 
attributes numlnstances and totalDuration specify usage statistics for a specific 
publication form instance. 


40 


MediaFormatHistory 


Describes usage history with respect to format of the content. The format specifiers 
considered are file format; content medium; system; video and audio coding format; 
aspect ratio; color appearance; and audio presentation. The attributes numlnstances 
and totalDuration specify usage statistics for a specific media format instance. 


numTotallnstances 


Total number of content items for which SourceHistory information has been 
generated. 



Key wordH istory DS Semantics 


Name 


Definition 


KeywordH istoryType 


Specifies history of the keywords the user has used while viewing/consuming content. 


Keywordltem 


Describes a specific keyword instance, and the statistical information associated with it 
(through numlnstances and totalDuration attributes 


numTotallnstances 


Total number of content items for which KeywordH istory information has been generated 
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10 



15 



20 



25 



30 



35 



SummarizationHistory DS Semantics 


Name 


Definition 


SummarizationHistory-Type 


Specifies summarization-associated viewing/consumption history for a user. 


SummaryComponent-History 


Describes the list of summary components consumed by the user (such as 
keyFrames, keyVideoClips, etc.) and the statistics associated with each, as defined 
by. 


SummaryThemeHistory 


Describes the list of textual themes associated with the summaries consumed by 

t ho near ac \Afoll ac tho ctsiticti/^c ff*r oaoh thomo ac Hofinori h\/ tho attrihiitoc 
Li Its Uocl, cto Well do lllc oLalloLlOo IVJI cdLI 1 lilt;/ lit:, do UtflllltrU Uy Lllc oLLIIUulco 

numlnstances and totalDuration. 


O i imman/ni iration Wict^rxi 

ouninidiyLJUfaMunpiioiuiy 


nocf rihoc tho Hi i rot i r\ r~\ i nf ArmutiAn f r\ r tho c 1 1 mm arioc pnnci i m orl hi/ tho 1 1 cor I h o 
UcoLI lUco lllc UUIdLlUil II IIUl 1 HaLlvJII IUI lilt? o Uf [ III 1 dl Ico UUIIbUlllcU Uy Lllc Uocf, } Ilcf 

associated elements of this complex type are: 


MinNumSummary Frames 


Size of the shortest/smallest key frame-based summary consumed by the user 
(stated in number of frames). 


MinSummaryDuration 


Temporal duration of the shortest summary consumed by the user. 


Max NumSummary Frames 


Size of the longest/largest key frame-based summary consumed by the user 
(stated in number of frames) 


MaxSummaryDuration 


Temporal duration of the longest summary consumed by the user. 


AvgN umSummaryFrames 


Average size of the key frame-based summaries consumed by the user (stated in 
number of frames). 


AvgSummaryDu ration 


Average temporal duration of the summaries consumed by the user. 


numTotal Instances 


Total number of content items for which SummzarizationHistory information has 
been generated. 



[0375] The following example highlights instantiations of the various UsageHistory description schemes. The exam- 
ple contains UserActionHistory and UserChoiceHistory information collected for 12 hours over two days. 
[0376] The example illustrates how the functionality provided by the Action Med iaTime and ActionGeneralTime ele- 
ments are different for some user actions such as "Rewind," "FastForward" and "SlowMotion." For example, as shown 
in the example, a "FastForward" action that lasts only a few seconds in terms of general time may actually correspond 
to several minutes in terms of the media time base. Relying only on the general time to represent the time of occurrence 
and duration of an action may lead to inconsistencies and ambiguity. Thus the proposed syntax supports representation 
of ActionTime in terms of both media time and general time. 



40 
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<UsageHistory id="usage-history-001 M allowCollection="true"> 
<UserIdentifier protected- 'true" 

<UserName xmhlang- ? en">John Doe<AJserName> 
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</UserIdentifier> 

<UserActionHistory id= ,, useraction-history-001" protection-' 
<ObservationPeriod> 

^imePoin^OOO-lO-O^lSiOO-OSiOO^imePoin^ 

<Duration>PT6H<Duration> 
</ObservationPeriod> 
<ObservationPeriod> 

<TimePoint>2000-10-10T18:00-08:00</TimePoint> 

<Duration>PT6H</Duration> 
</ObservationPeriod> 
<UserActionList id="ua-list-001 " 

numlnstances= ,? 2" totalDuration= , T2H30M M > 

<ActionType><Label>Record</Label></ActionType> 

<UserAction> ... </UserAction> 

<UserAction> ... </UserAction> 
</UserActionList> 
<UserActionList id= ,t ua-list-002" 

numlnstances= ,, 25 n totalDuration="P7H02M"> 

<ActionTypexLabel>View</Labelx/ActionType> 

<UserAction> 

<ProgramIdentifier> 
<IDOrganization> 

<FreeTenn>AnIDOrg</FreeTerm> 



96 



EP 1 189 437 A2 

</IDOrganization> 
<IDName> 

<FreeTenn>AnIDNarne</FreeTerm> 
</EDName> 

<UniqueID>02-miif- 1 00900</UniqueID> 
</ProgramIdentifier> 
</UserAction> 
<UserAction> 

<ProgramIdentifier> 
<BDOrganization> 

<FreeTenn>An]DOrg</FreeTerm> 
</IDOrganization> 
<IDName> 

<FreeTerm>AnEDName</FreeTerm> 
</IDName> 

<UniqueID>02-mnf-100900</[JniqueID> 
</ProgramIdentif3er> 
<ActionDataItem> 

<href> 

www.abc.com/content/mnf71 00900/mnf- 

stream.xml#segment_l 45 

</hre£> 
</ActionDataItem> 
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</UserAction> 

</UserActionList> 
<UserActionList id="ual-003" 

numlnstances="3" totalDuration="PT4Ml 0S M > 

<ActionType><Label>FastForward</Label></ActionType> 

<UserAction> 

<ActionTime> 

<ActionMediaTime> 

<MediaTimePoint>2000-10- 

09T1 9: 10: 12</MediaTimePoint> 
<MediaDuration>PT 1 M45S</MediaDuration> 
</ActionMediaTime> 
<ActionGeneraITime> 

<TimePoint>2000-10-09Tl 9: 1 0: 12-08 :00</TimePoint> 
<Duration>PT8S</Duration> 
</ActionGeneralTime> 
</ActionTime> 
<PrograniIdentifier> 

<IDOrgani2ation> 

<FreeTerm>AnIDOrg</FreeTerm> 
</IDOrganization> 
<EDName> 
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<FreeTerm>AnIDName</F reeTerrn> 
</IDName> 

<UniqueID>02-mnf- 1 00900<UniqueID> 
</Program]dentifier> 
<ActionDataItem> 

<hre£> 

www.abc.com/content/mn^l 00900/mnf- 

streani.xml#conun_break_l 7 

</href> 
</ActionDataItem> 
<^UserAction> 
<UserAction> 

<ActionTime> 
<ActionMediaTime> 

<MediaTimePoint>2000-l 0- 
1 OT 1 8 : 1 6 : 08</MediaTimePoint> 

<MediaDuration>PTlM35S</MediaDuration> 
</ActionMediaTime> 
<ActionGeneralTime> 

<TimePoint>2000- 1 0-1 0T1 8: 1 6:08-08;00</TimePoint> 
<Duration>PT7S</Duration> 
</ActionGeneralTiine> 
</ActionTirne> 
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<ProgramIdentifier> 
<IDOrganization> 

<FreeTerm>AnIDOrg</FreeTerm> 
</IDOrganization> 
<IDName> 

<FreeTeim>AnIDNaine</FreeTerni> 
</BDName> 

<UniqueID>01 -wnpj- 1 0 1 000</UniqueID> 
</ProgramIdentifier> 
<ActionDataItem> 

<href> 

www.abc. com/content/news/ 1 0 1 000/wnpj .xml#news-item 
</hreJ> 

</ActionDataItem> 
<yUserAction> 
<UserAction> 
<ActionTime> 

<ActionMediaTime> 

<MediaTimePoint>2000-l 0- 

10T20:05:34</MediaTimePoint> 
<MediaDuration>PT 1 M</MediaDuration> 
</ActionMediaTime> 
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<ActionGeneralTime> 

<TimePoint>2000-l 0- 1 0T20:O5 :34-08:00</TimePoint> 
<Duration>PT5S</Duration> 
</ActionGeneralTime> 
<yActionTime> 
<ProgramIdentifier> 
<IDOrganization> 

<FreeTcrm>AjiIDOrg</FreeTerm> 
</IDOrganization> 
<IDName> 

<FreeTcrm>Ann5Name</FreeTercTi> 
</EDName> 

<UniqueID>03-tss-000063</UniqueID> 
</ProgramIdentifier> 
<ActionDataItem> 

<href> 

www.foxxoni/xml/that70sshow/063/tss--063.xml#break_2 
</href 
</ActionDataIteni> 
</UserAction> 
</UserActionList> 

<UserActionList id^'ual-004" numlnstances= ,, l ,, > 
<ActionType> 
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<Label>ClickThrough</Label> 
</ActionType> 
<UserAction> 

<ActionTime> 
<ActionGeneralTime> 

<TimePoint>2000- 1 0-09T 1 8:48:0 1 -08:00</TimePoint> 
</ActionGeneralTime> 
</ActionTiine> 
<ProgramIdentifier> 
<IDOrganization> 

<FreeTerm>AnEDOrg<^TreeTerm> 
</IDOrganization> 
<IDName> 

<FreeTeim>AiiIDName</FreeTenn> 
</IDName> 

<UniqueID>02-mnf- 1 00900</UniqueID> 
</ProgramIdentifier> 
<ActionDataItem> 

<hief> 

www.abc.com/content/rnnf71 00900/mnf- 
.xml#related_media_ID_12 
</hreC> 

</ActionDataItem> 



102 



EP 1 189 437 A2 



</UserAction> 
<AJserActionList> 
</UserActionHistory> 

<UserChoiceHistory id^userchoice-history-OOl" protection="false"> 
<ObservationPeriod> 

^LmePoiii^OOO-lO-OgTlSrOO-OSrOO^imePoin^ 

<Duration>PT6H</Duration> 
</ObservationPeriod> 
<ObservationPeriod> 

<TimePoint>2000-l 0- 1 OT i 8 :00-08 :00</TimePoint> 

<Duration>PT6H</Duration> 
</ObservationPeriod> 

<ClassificationHistory id— 'classification-hist-OOl " 
numTotalInstances= ,, 24"> 
<CountryHistory id="coimtry-hist-OO.r 
numlnstances=" 15" 

totalDuration="PT7H22M>iis<CountryHistory> 

<GenreHistory id =, 'genre-hist-001 M numInstances= M 7 M 
totalDuration="PT2H23M" term-" 1 .2.3" 

scheme-' AScheme"> 
<Label xmI:lang= s "en t, >Football</Label> 
</GenreHi story > 
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<GenreHistory id="genre-hist-002 ,r numlnstances="4" 

totaiDuration=="PTlH46M" tenn= ,, 2.4 M scheme="AScheme"> 
<Label xml:lang= ,, en ,, >Sitcom</LabeI> 
</GenreHistory> 

</ClassificationHistoiy> 
<CreationHistory id="creation-hist-00 1 " 
numTotalInstances= ,, 26 ,, > 

<TitleHistoiy xml:lang= n en" id^'title-hist-OOl" numlnstances=="26" 
totalDuration= , TT2H23M">Monday Night 

Football</TitleHistory> 

<CreatorHistory id="creator-hist-00r numlnstances= ,, 6" 
totalDuration= M PT56M"> 

<Role term="3.5.2" scheme= M AnotherScheme"> 

<Label>Actor</Label> 
</Role> 
<Person> 

<Name xml:lang> 

<GivenName>Jason Alexander</GivenName> 

</Name> 
</Person> 

<CharacterxGivenName>George 
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Costanza</GivenName></Character> 

</Cr eatorHi story> 

</CreationHistory> 
<SourceHistory id="source-hist-001 " 
numTotalInstances="22"> 

<PublisherHistory xsi:type= ,, OrganizationType" 
id="publisher-hist-001 " numInstances="3 M 
totalDuration= ,l PT4H42M ,, > 
<Name xml:lang= ,, en">ABC</Name> 

</PublisherHistory> 



35 </SourceHistory> 



40 </UserChoiceHistory> 
</UsageHistory> 

^5 [0377] The terms and expressions that have been employed in the foregoing specification are sued as terms of 
description and not of limitation, and there is no intention, in the use of such terms and expressions, of excluding 
equivalents of the features shown and described or portions thereof, it being recognized that the scope of the invention 
is defined and limited only by the claims that follow. 



Claims 

1 . A usage history description scheme (740) for at least one of an audio, an image, and a video comprising a plurality 
of frames comprising: 

(a) said usage history description scheme (740) containing information about a user with respect to said at 
least one of said audio, image, and video based upon previous usage of said at least one of said audio, image, 
and video; and 
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(b) said usage history description scheme (740) including at least one type defined by, at least in part, a 
thesaurus. 

2. A description scheme for at least one of an audio, an image, and a video comprising a plurality of frames comprising: 

5 

(a) said description scheme containing information about a user with respect to said at least one of said audio, 
image, and video based upon previous usage of said at least one of said audio, image, and video; and 

(b) said description scheme including at least one type characterized by, at least in part, a time associated 
with said at least one of said audio, image, and video. 

10 

3. A system including a usage history description scheme (740) for at least one of an audio, an image, and a video 
comprising a plurality of frames comprising: 

(a) said usage history description scheme (740) containing information about a user with respect to said at 
least one of said audio, image, and video based upon previous usage of said at least one of said audio, image, 
and video; 

(b) content descriptions referenced by said usage history description scheme (740); and 

(c) said system including referencing functionality permitting said usage history description scheme (740) to 
reference selected portions of selected said content descriptions. 

A system including a usage history description scheme (740) for at least one of an audio, an image, and a video 
comprising a plurality of frames comprising: 

(a) said usage history description scheme (740) containing information about a user with respect to said at 
25 least one of said audio, image, and video based upon previous usage of said at least one of said audio, image, 

and video; and 

(b) said system capturing contentfor said usage history description scheme (740) at a selectable level of detail. 

5. A usage history description scheme (740) for at least one of an audio, an image, and a video comprising a plurality 
30 of frames comprising: 

(a) said usage history description scheme (740) containing information about a user with respect to said at 
least one of said audio, image, and video based upon previous usage of said at least one of said audio, image, 
and video; and 

35 (b) said usage history description scheme (740) including data indicating whether the user's identity is intended 

to be revealed to third parties. 

6. A system including a usage history description scheme (740) for at least one of an audio, an image, and a video 
comprising a plurality of frames comprising: 

40 

(a) said usage history description scheme (740) containing information about a user with respect to said at 
least one of said audio, image, and video based upon previous usage of said at least one of said audio, image, 
and video; and 

(b) said system selectively defining the content of said usage history description scheme at multiple levels of 
45 granularity. 

7. A method of using usage history information describing at least one of an audio, an image, and a video comprising 
a plurality of frames comprising: 

so (a) providing said usage history information containing information about a user with respect to said at least 

one of said audio, image, and video based upon previous usage of said at least one of said audio, image, and 
video; and 

(b) selectively defining the content of said usage history description scheme at multiple levels of granularity 

55 8. A method of obtaining usage history information describing at least one of an audio, an image, and a video com- 
prising a plurality of frames comprising obtaining said usage history information containing information about a 
user with respect to said at least one of said audio, image, and video based upon previous usage of said at least 
one of said audio, image, and video in a manner free from explicit user input. 
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(node) has a Preference 
Value attribute.PV 
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FIG.54 
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i ) Individual preferences are 

combined into a hierarchy to 
form a preference template. 

ii ) Each individual preference 

contains a test and a preference 
value (PVa,PVb,..) 



V 



iii) EachProgram has a 
description which is 
composed of features. 

© A E 
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iv ) Evaluate each program 
against individual 
preference tests. 
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The Filter Agent walks through the 
hierarchy of individual preferences. 
The individual tests are evaluted 
and the results sre weighted 
by the preference values. 
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v ) The individual test results 
(Ra,Rb,...)are organized 
in the same hierarchy as 
the preference template. 



The Filter Agent walks through the 
hierarchy of connections between 
individual test results. Each connection 
requires a combinatorial operation. 

Sibling nodes may be combined 
in operations such as 0R f AND, 
MAX,MIN,AVG,SUM, ... 

Child nodes may be wrapped up into 
the parent node to create a composite 
result, or they may be individually 
evaluated to create a sublist at that level. 
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vi) Combine individual test results. 
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vii) Sort Program. 
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List of ranked Programs. 
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OR'ing of container 
elements with different 
number of children in each. 
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